[confgenerator] Support `LoggingProcessorParseMultilineRegex` in Otel Logging. #2103

franciscovalentecastro · 2025-10-24T18:24:08Z

Description

Implement LoggingProcessorParseMultilineRegex and LoggingProcessorParseRegexComplex in Otel Logging.

Details

Fixed saphana receiver incorrect use of type "int" which should be "integer".
Created logging-otel-receiver_kafka confgenerator test to validate resulting config.
Updated a lot of relevant transformation test goldens.

Related issue

b/440599473

How has this been tested?

Checklist:

Unit tests
- Unit tests do not apply.
- Unit tests have been added/modified and passed for this PR.
Integration tests
- Integration tests do not apply.
- Integration tests have been added/modified and passed for this PR.
Documentation
- This PR introduces no user visible changes.
- This PR introduces user visible changes and the corresponding documentation change has been made.
Minor version bump
- This PR introduces no new features.
- This PR introduces new features, and there is a separate PR to bump the minor version since the last release already.
- This PR bumps the version.

jefferbrecht · 2025-11-13T19:02:20Z

confgenerator/logging_processors.go

+
+	var exprParts []string
+	for _, r := range isFirstEntry {
+		exprParts = append(exprParts, fmt.Sprintf("body.message matches %q", r))


Why not build exprParts directly in the first loop and eliminate isFirstEntry altogether?

No real reason. In the other parse_multiline PR, we have a more complicated expressions setup, so it made more sense to do it in steps. I simplified it. Done!

jefferbrecht · 2025-11-13T19:13:07Z

confgenerator/logging_processors.go

+		// TODO: b/459877163 - Update implementation when opentelemetry supports "state-machine" multiline parsing.
+		if r.StateName == "start_state" {
+			isFirstEntry = append(isFirstEntry, r.Regex)
+		}


I assume that by ignoring some states, there will be some possible log inputs that won't parse properly as multiline. (If that's not true, then we should be able to refactor the receivers to only have start_state.)

How do you want to approach testing for that gap? E.g. will you add/change transformation tests later along with b/459877163 to validate whichever edge cases wouldn't work today without a full state machine?

All the current uses of LoggingProcessorParseMultilineRegex ¹ only set two states start_state and cont_state in a simplified manner such that start_state = FirstLogLineRegex and cont_state = Negation of "FirstLogLineRegex" (note : i've just double checked ¹, also git grep -C 10 "start_state" helps).

ops-agent/apps/solr.go

Lines 84 to 94 in ccfedc9

Rules: []confgenerator.MultilineRule{

{

StateName: "start_state",

NextState: "cont",

Regex: `^\d{4}-\d{2}-\d{2}\s\d{2}:\d{2}:\d{2}\.\d{3}\s[A-z]+\s{1,5}`,

},

{

StateName: "cont",

NextState: "cont",

Regex: `^(?!\d{4}-\d{2}-\d{2}\s\d{2}:\d{2}:\d{2}\.\d{3}\s[A-z]+\s{1,5})`,

},

This implies all current uses of LoggingProcessorParseMultilineRegex can be fully replicated by only setting "is_first_entry" in otel logging.

Proposed Refactor

We could refactor LoggingProcessorParseMultilineRegex to only be able to set a start_state and then set cont_state programatically as the "negation" of the "start_state". This will enforce this simplified use of multiline features.

How do you want to approach testing for that gap? E.g. will you add/change transformation tests later along with b/459877163 to validate whichever edge cases wouldn't work today without a full state machine?

Re @jefferbrecht

It depends. What do you think of the Proposed Refactor ?

If we refactor LoggingProcessorParseMultilineRegex to only set a start_state, then there won't be any feature gaps and the 3P app receiver tests are good enough for this.

If we don't refactor it, creating a "transformation_test" would be artificial since we would need to create a "test processor" that uses all the feature of the "state-machine". A "transformation_test" needs a "registered" processor to be able to add to a pipeline.

Footnotes

https://github.com/search?q=repo%3AGoogleCloudPlatform%2Fops-agent%20LoggingProcessorParseMultilineRegex&type=code ↩ ↩²

NVM, not "all" 3P app receiver implementations use the simplified set of start_state and cont_state. Though it's only mysql_slow and elasticsearch_json the ones that use a more complicated (not too much) set of regexes.

How do you want to approach testing for that gap? E.g. will you add/change transformation tests later along with b/459877163 to validate whichever edge cases wouldn't work today without a full state machine?

The tests of mysql_slow and elasticsearch_json can serve to compare with the use of all "state-machine" like features.

See draft refactor : dc991f6#diff-7def08d2dee0c2606af18bb82d03c649a7ece83d1fb913fcfb800c6558eb942e

…ng support.

…essed.

franciscovalentecastro force-pushed the fcovalente-parse-multiline-regex branch from 3f4322a to 68b4a04 Compare November 11, 2025 18:54

franciscovalentecastro requested review from a team, hsmatulis and ridwanmsharif and removed request for a team and hsmatulis November 11, 2025 22:32

franciscovalentecastro force-pushed the fcovalente-parse-multiline-regex branch from 367d8fa to 6ac6e16 Compare November 13, 2025 01:51

franciscovalentecastro requested review from a team, avilevy18, jefferbrecht and quentinmit and removed request for a team, avilevy18 and ridwanmsharif November 13, 2025 16:30

franciscovalentecastro added the kokoro:force-run Forces kokoro to run integration tests on a CL label Nov 13, 2025

stackdriver-instrumentation-release removed the kokoro:force-run Forces kokoro to run integration tests on a CL label Nov 13, 2025

jefferbrecht reviewed Nov 13, 2025

View reviewed changes

franciscovalentecastro added 8 commits November 13, 2025 19:35

Add LoggingProcessorParseMultilineRegex support.

b63632e

Fix saphana types.

25563ad

Update transformation test goldens.

4fe6afb

Update transformation tests goldens.

e7d1252

Update confgenerator goldens.

1c6e766

Cleanup implementation.

17094dd

Set poll_interval and force_flush_period to default values.

5ae4365

Update transformation test goldens.

aa90389

franciscovalentecastro added 9 commits November 13, 2025 19:35

Improve implementation and add comments about current multiline parsi…

469dd29

…ng support.

Improve comment.

2fa173c

Simplify implementation.

eaf89ab

Add logging-otel-receiver_kafka confgenerator test.

edf5cff

Update force_flush_period to match fluent-bit "flush_timeout".

b03b7f1

Update consumingCount check to be sure the whole input file is proc…

545db20

…essed.

Update transformation_test goldens.

04b5180

Update confgenerator goldens.

ccd2ddc

Simplify isFirstEntryExpr calculation.

67d3e0e

franciscovalentecastro force-pushed the fcovalente-parse-multiline-regex branch from 6ac6e16 to 67d3e0e Compare November 13, 2025 19:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[confgenerator] Support `LoggingProcessorParseMultilineRegex` in Otel Logging. #2103

[confgenerator] Support `LoggingProcessorParseMultilineRegex` in Otel Logging. #2103

Uh oh!

franciscovalentecastro commented Oct 24, 2025 •

edited

Loading

Uh oh!

jefferbrecht Nov 13, 2025

Uh oh!

franciscovalentecastro Nov 13, 2025 •

edited

Loading

Uh oh!

jefferbrecht Nov 13, 2025

Uh oh!

franciscovalentecastro Nov 13, 2025

Uh oh!

franciscovalentecastro Nov 13, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	Rules: []confgenerator.MultilineRule{
	{
	StateName: "start_state",
	NextState: "cont",
	Regex: `^\d{4}-\d{2}-\d{2}\s\d{2}:\d{2}:\d{2}\.\d{3}\s[A-z]+\s{1,5}`,
	},
	{
	StateName: "cont",
	NextState: "cont",
	Regex: `^(?!\d{4}-\d{2}-\d{2}\s\d{2}:\d{2}:\d{2}\.\d{3}\s[A-z]+\s{1,5})`,
	},

[confgenerator] Support LoggingProcessorParseMultilineRegex in Otel Logging. #2103

Are you sure you want to change the base?

[confgenerator] Support LoggingProcessorParseMultilineRegex in Otel Logging. #2103

Uh oh!

Conversation

franciscovalentecastro commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Details

Related issue

How has this been tested?

Checklist:

Uh oh!

jefferbrecht Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

franciscovalentecastro Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jefferbrecht Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

franciscovalentecastro Nov 13, 2025

Choose a reason for hiding this comment

Proposed Refactor

Footnotes

Uh oh!

franciscovalentecastro Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[confgenerator] Support `LoggingProcessorParseMultilineRegex` in Otel Logging. #2103

[confgenerator] Support `LoggingProcessorParseMultilineRegex` in Otel Logging. #2103

franciscovalentecastro commented Oct 24, 2025 •

edited

Loading

franciscovalentecastro Nov 13, 2025 •

edited

Loading

franciscovalentecastro Nov 13, 2025 •

edited

Loading