/envs/py2env/lib/python2.7/site-packages/apache_beam/runners/dataflow/internal/dependency.pyc in _stage_beam_sdk_tarball/usr/local/envs/py2env/lib/python2.7/site-packages/<em
'write' >> beam.io.WriteToText(output_gcs)p.run().wait_until_finish()File "/usr/local/lib/python2.7/dist-packages/apache_beam/internal/pickler.py", line 212, in loads File "&
我想读取一个GZIP压缩的帕奎特文件从GCS到BigQuery使用PythonSDK for Apache光束。但是,apache_beam.io.parquetio.ReadFromParquet方法似乎不支持从压缩文件中读取。根据源代码,压缩类型被硬编码为UNCOMPRESSED。有没有一个技巧来读取压缩的拼图文件,而不需要在GCS中预先解压缩文件?如果这是唯一的方法,有没有办法在GCS中直接解压缩文件?
/apache_beam/runners/worker/sdk_worker.py", line 134, in _executeFile "/usr/local/lib/python2.7/dist-packages/apache_beam/runners/worker/sdk_worker.py",