BOR-519: Implement ordered event parser pipeline by bdclaw2026 · Pull Request #28 · STRRL/lapp

bdclaw2026 · 2026-03-11T15:58:32Z

Summary

add an ordered raw-line parser pipeline in pkg/event
support JSON, quoted logfmt, generic key=value, prefix extraction, and plain-text fallback
add parser tests for precedence, fallback behavior, and raw-line preservation

Validation

make unit-test
make lint

gemini-code-assist · 2026-03-11T15:59:29Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a robust and extensible event parsing mechanism, centralizing the logic for converting diverse log line formats into a standardized Event structure. This enhancement significantly improves the system's ability to ingest and process various log sources consistently, laying the groundwork for more reliable data analysis and monitoring.

Highlights

Event Parsing Pipeline: Implemented an ordered raw-line parser pipeline within the pkg/event package.
Supported Formats: Added support for parsing JSON, quoted logfmt, generic key=value pairs, prefix extraction, and plain-text fallback.
Testing: Included comprehensive parser tests to validate precedence, fallback behavior, and raw-line preservation.

Changelog

pkg/event/parser.go
- Implemented the ParseLine function which orchestrates the parsing process through a chain of specialized parsers.
- Defined jsonLineParser, logfmtLineParser, keyValueLineParser, prefixLineParser, and plainTextLineParser to handle different log formats.
- Included utility functions for timestamp parsing, level canonicalization, and key-value assignment.
pkg/event/parser_test.go
- Added TestParseLine_JSON to verify JSON parsing.
- Added TestParseLine_Logfmt to verify logfmt parsing.
- Added TestParseLine_KeyValue to verify key-value parsing.
- Added TestParseLine_PrefixFallback to verify prefix-based parsing and fallback.
- Added TestParseLine_PlainTextFallback to verify plain text fallback.
- Added TestParseLine_OrderedParsers to confirm parser precedence rules.
- Included helper functions assertAttr and assertTimestamp for test assertions.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

The pull request introduces an ordered event parser pipeline, which is a well-structured approach to handling various log formats. The implementation covers JSON, logfmt, key-value, prefix extraction, and plain-text fallback, demonstrating a comprehensive parsing strategy. The included tests are thorough and effectively validate the precedence and fallback behaviors of the parsers. Overall, the changes are well-implemented and contribute positively to the codebase.

gemini-code-assist · 2026-03-11T16:05:07Z

pkg/event/parser.go

+	timestampPrefixWithLevelPattern = regexp.MustCompile(`^\s*([0-9]{4}-[0-9]{2}-[0-9]{2}(?:[T ][0-9]{2}:[0-9]{2}:[0-9]{2}(?:\.[0-9]+)?(?:Z|[+-][0-9]{2}:[0-9]{2})?))\s+\[?([A-Za-z]+)\]?:?\b`)
+	timestampPrefixPattern          = regexp.MustCompile(`^\s*([0-9]{4}-[0-9]{2}-[0-9]{2}(?:[T ][0-9]{2}:[0-9]{2}:[0-9]{2}(?:\.[0-9]+)?(?:Z|[+-][0-9]{2}:[0-9]{2})?))\b`)
+	levelPrefixPattern              = regexp.MustCompile(`^\s*\[?([A-Za-z]+)\]?:?\b`)


The timestamp pattern is quite complex and is repeated in timestampPrefixWithLevelPattern and timestampPrefixPattern. Extracting this common sub-pattern into a separate named constant or a helper function could significantly improve readability and maintainability of these regular expressions. This would make it easier to understand and modify the timestamp matching logic in the future.

var ( timestampPattern = `[0-9]{4}-[0-9]{2}-[0-9]{2}(?:[T ][0-9]{2}:[0-9]{2}:[0-9]{2}(?:\.[0-9]+)?(?:Z|[+-][0-9]{2}:[0-9]{2})?)` timestampPrefixWithLevelPattern = regexp.MustCompile(`^\s*(` + timestampPattern + `)\s+\[?([A-Za-z]+)\]?:?\b`) timestampPrefixPattern = regexp.MustCompile(`^\s*(` + timestampPattern + `)\b`) levelPrefixPattern = regexp.MustCompile(`^\s*\[?([A-Za-z]+)\]?:?\b`) )

gemini-code-assist · 2026-03-11T16:05:07Z

pkg/event/parser.go

+		SourceFormat: SourceFormatPlainText,
+		Event:        event,
+	}, true


When prefixLineParser successfully extracts a timestamp or level, it still categorizes the SourceFormat as SourceFormatPlainText. While this parser acts as a fallback before the ultimate plainTextLineParser, labeling it as PlainText might not fully reflect that structured information (timestamp, level) was successfully extracted. Consider introducing a more specific SourceFormatPrefix to better distinguish lines where prefixes were parsed from truly unstructured plain text lines. This would improve the granularity of source format identification.

Implement ordered event parser pipeline

7152478

gemini-code-assist bot reviewed Mar 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BOR-519: Implement ordered event parser pipeline#28

BOR-519: Implement ordered event parser pipeline#28
bdclaw2026 wants to merge 1 commit intoSTRRL:masterfrom
bdclaw2026:bdclaw/bor-519-phase-3parser-pipeline-implement-ordered-parsers-and

bdclaw2026 commented Mar 11, 2026

Uh oh!

gemini-code-assist bot commented Mar 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Mar 11, 2026

Uh oh!

gemini-code-assist bot Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bdclaw2026 commented Mar 11, 2026

Summary

Validation

Uh oh!

gemini-code-assist bot commented Mar 11, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant