There is a category of production incident that engineering teams are not tracking yet — because it doesn't fit any existing postmortem template. The agent initiated an action. The action was ...
Students in Vincent St-Amour’s new Responsible Software Engineering course are analyzing case studies of software failures and exploring tools and techniques to prevent similar disasters Software ...
Railway Highlights the Importance of Logs, Metrics, Traces, and Alerts for Diagnosing System Failure
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results