这是indexloc提供的服务,不要输入任何密码
Skip to content

[Bug]: _metadata_big_query_commit_timestamp is not populated when useStorageWriteApi=true in Spanner to BigQuery template #2341

@ruslan-yuldashev-sxp

Description

@ruslan-yuldashev-sxp

Related Template(s)

Spanner_Change_Streams_to_BigQuery

Template Version

Latest version from Google Cloud Console (Flex Template)

What happened?

I'm using the Cloud Spanner change streams to BigQuery template as described in the documentation:
https://cloud.google.com/dataflow/docs/guides/templates/provided/cloud-spanner-change-streams-to-bigquery

My pipeline is configured with --useStorageWriteApi=true.

The documentation mentions the following:

_metadata_big_query_commit_timestamp: The commit timestamp when the row is inserted into BigQuery. If useStorageWriteApi is true, this column is not automatically created in the changelog table by the pipeline. In that case, you must manually add this column in the changelog table if needed.

  • I manually added the _metadata_big_query_commit_timestamp column to the changelog table schema (as a TIMESTAMP).
  • The pipeline is running correctly and writing rows to BigQuery.
  • However, the _metadata_big_query_commit_timestamp field is never populated — it remains NULL in all rows.

🔍 Question:

Is this field supposed to be automatically populated by the template or by BigQuery Storage Write API when useStorageWriteApi=true?

If not — what is the recommended way to populate this value?

Thank you!

Relevant log output

Metadata

Metadata

Labels

bugSomething isn't workingp2

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions