-
Notifications
You must be signed in to change notification settings - Fork 270
Adding Additional Failure Patterns for Alerts #3663
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
…verify if the patterns are detected
AviralGoelAMD
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull request overview
This PR enhances failure detection in Jenkins pipelines by adding new error patterns and refining existing ones. The changes improve the monitoring and alerting capabilities for Docker authentication issues and sccache startup failures.
Changes:
- Added two new failure patterns to detect Docker Personal Access Token authentication failures and sccache server startup errors
- Simplified two existing regex patterns by removing unnecessary capture groups
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
| def failurePatterns = [ | ||
| [pattern: /login attempt to .* failed with status: 401 Unauthorized/, description: "Docker registry authentication failed"], | ||
| [pattern: /(.*)docker login failed(.*)/, description: "Docker login failed"], | ||
| [pattern: /.*docker login failed.*/, description: "Docker login failed"], |
Copilot
AI
Jan 28, 2026
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This pattern is too generic and will match the more specific Docker authentication patterns on lines 41 and 47. Consider either removing this pattern since it's now redundant, or making it more specific to catch different Docker login failure scenarios not covered by the other patterns.
| [pattern: /.*docker login failed.*/, description: "Docker login failed"], | |
| [pattern: /^.*docker login failed(?!.*(status: 401 Unauthorized|unauthorized: your account must log in with a Personal Access Token)).*$/, description: "Docker login failed"], |
| [pattern: /.*GPU not found.*/, description: "GPU not found"], | ||
| [pattern: /Could not connect to Redis at .* Connection timed out/, description: "Redis connection timed out"], | ||
| [pattern: /.*unauthorized: your account must log in with a Personal Access Token.*/, description: "Docker login failed"], | ||
| [pattern: /.*sccache: error: Server startup failed: Address in use.*/, description: "Sccache Error"] |
Copilot
AI
Jan 28, 2026
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The description 'Sccache Error' is inconsistent with the capitalization and naming style of other error descriptions in this list. Consider renaming to 'sccache server startup failed' to match the pattern of descriptive, lowercase error messages used elsewhere (e.g., 'Docker registry authentication failed', 'Redis connection timed out').
| [pattern: /.*sccache: error: Server startup failed: Address in use.*/, description: "Sccache Error"] | |
| [pattern: /.*sccache: error: Server startup failed: Address in use.*/, description: "sccache server startup failed"] |
Proposed changes
How was this tested
I added a temporary test function to log failure messages and force the pipeline to fail. Once failed I verified that the test logs were captured and alerts were sent.