You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

step-04c-subagent-reliability.md 1.8KB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364656667686970717273747576777879808182838485
  1. ---
  2. name: 'step-04c-subagent-reliability'
  3. description: 'Subagent: Reliability NFR evidence audit'
  4. subagent: true
  5. outputFile: '/tmp/tea-nfr-reliability-{{timestamp}}.json'
  6. ---
  7. # Subagent 4C: Reliability NFR Evidence Audit
  8. ## SUBAGENT CONTEXT
  9. This is an **isolated subagent** running in parallel with other NFR domain evidence audits.
  10. **Your task:** Assess RELIABILITY NFR domain only.
  11. ---
  12. ## SUBAGENT TASK
  13. ### 1. Reliability Evidence Audit Categories
  14. **A) Error Handling:**
  15. - Try-catch blocks for critical operations
  16. - Graceful degradation
  17. - Circuit breakers
  18. - Retry mechanisms
  19. **B) Monitoring & Observability:**
  20. - Logging implementation
  21. - Error tracking (Sentry/Datadog)
  22. - Health check endpoints
  23. - Alerting systems
  24. **C) Fault Tolerance:**
  25. - Database failover
  26. - Service redundancy
  27. - Backup strategies
  28. - Disaster recovery plan
  29. **D) Uptime & Availability:**
  30. - SLA targets
  31. - Historical uptime
  32. - Incident response
  33. ---
  34. ## OUTPUT FORMAT
  35. ```json
  36. {
  37. "domain": "reliability",
  38. "risk_level": "LOW",
  39. "findings": [
  40. {
  41. "category": "Error Handling",
  42. "status": "PASS",
  43. "description": "Comprehensive error handling with circuit breakers",
  44. "evidence": ["Circuit breaker pattern in src/services/", "Retry logic implemented"],
  45. "recommendations": []
  46. },
  47. {
  48. "category": "Monitoring",
  49. "status": "CONCERN",
  50. "description": "No APM (Application Performance Monitoring) tool",
  51. "evidence": ["Logging present but no distributed tracing"],
  52. "recommendations": ["Implement APM (Datadog/New Relic)", "Add distributed tracing"]
  53. }
  54. ],
  55. "compliance": {
  56. "SLA_99.9": "PASS"
  57. },
  58. "priority_actions": ["Implement APM for better observability"],
  59. "summary": "Reliability is good with minor monitoring gaps"
  60. }
  61. ```
  62. ---
  63. ## EXIT CONDITION
  64. Subagent completes when JSON output written to temp file.