You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Following the release of PrismRunner for Python SDK with Apache Beam 2.59.0, we are trying to adapt our code so that we can use the same code in GCP as in Local debugging (DirectRunner lacks many features). The only difference we have in the code is that in Dataflow we listen from Pub/Sub whereas in Local we use test json local files that are read as Python dictionaries and then they are instantiated in a PCollection with beam.Create.
The problem we are facing is that this feature (beam.Create) is not implemented in PrismRunner yet and gives us the following error:
INFO:apache_beam.utils.subprocess_server:2024/10/10 12:21:11 ERROR unable to run job cause="unimplemented features" jobname=job errors="unsupported feature \"PTransform.Spec.Urn\" set with value beam:transform:pickled_python:v1 Create/MaybeReshuffle"
INFO:apache_beam.utils.subprocess_server:2024/10/10 12:21:11 ERROR job failed job.key=job-001 job.name=job error="found 1 uses of features unimplemented in prism in job job:\nunsupported feature \"PTransform.Spec.Urn\" set with value beam:transform:pickled_python:v1 Create/MaybeReshuffle"
We are opening this issue since the lack of implementation of this feature is not documented in the list of missing features and we want to make sure it does not slip out of the roadmap since it is a basic transformation for local development in order not depend from complex I/O resources:
In the [2.59.0 release](https://beam.apache.org/blog/beam-2.59.0/), Prism passes most runner validations tests with the exceptions of pipelines using the following features:
OrderedListState, OnWindowExpiry (eg. GroupIntoBatches), CustomWindows, MergingWindowFns, Trigger and WindowingStrategy associated features, Bundle Finalization, Looping Timers, and some Coder related issues such as with Python combiner packing, and Java Schema transforms, and heterogenous flatten coders. Processing Time timers do not yet have real time support.
Thank you!
Issue Priority
Priority: 2 (default / most bugs should be filed as P2)
Issue Components
Component: Python SDK
Component: Java SDK
Component: Go SDK
Component: Typescript SDK
Component: IO connector
Component: Beam YAML
Component: Beam examples
Component: Beam playground
Component: Beam katas
Component: Website
Component: Infrastructure
Component: Spark Runner
Component: Flink Runner
Component: Samza Runner
Component: Twister2 Runner
Component: Hazelcast Jet Runner
Component: Google Cloud Dataflow Runner
The text was updated successfully, but these errors were encountered:
What happened?
Following the release of PrismRunner for Python SDK with Apache Beam 2.59.0, we are trying to adapt our code so that we can use the same code in GCP as in Local debugging (DirectRunner lacks many features). The only difference we have in the code is that in Dataflow we listen from Pub/Sub whereas in Local we use test json local files that are read as Python dictionaries and then they are instantiated in a PCollection with
beam.Create
.The problem we are facing is that this feature (
beam.Create
) is not implemented in PrismRunner yet and gives us the following error:We are opening this issue since the lack of implementation of this feature is not documented in the list of missing features and we want to make sure it does not slip out of the roadmap since it is a basic transformation for local development in order not depend from complex I/O resources:
Thank you!
Issue Priority
Priority: 2 (default / most bugs should be filed as P2)
Issue Components
The text was updated successfully, but these errors were encountered: