Streaming read/writes to Google Storage blobs with ascynchronous buffering.
Project description
gs-chunked-io: Streams for Google Storage
gs-chunked-io provides transparently chunked io streams for google storage objects. Writable streams are managed as multipart objects that are composed when the stream is closed.
import gs_chunked_io as gscio
from google.cloud.storage import Client
client = Client()
bucket = client.bucket("my-bucket")
blob = bucket.get_blob("my-key)
# Readable stream:
with gscio.Reader(blob) as fh:
fh.read(size)
# Readable stream, download in background:
with gscio.AsyncReader(blob) as fh:
fh.read(size)
# Writable stream:
with gscio.Writer("my_new_key", bucket) as fh:
fh.write(data)
# Writable stream, upload in background:
with gscio.AsyncWriter("my_new_key", bucket) as fh:
fh.write(data)
# Process blob in chunks:
with gscio.Reader(blob) as reader:
for chunk in reader.for_each_chunk():
my_chunk_processor(chunk)
# Multipart copy with processing:
dst_bucket = client.bucket("my_dest_bucket")
with gscio.Writer("my_dest_key", dst_bucket) fh_write:
with gscio.AsyncReader(blob) as reader:
for chunk in reader.for_each_chunk(blob):
process_my_chunk(chunk)
fh_write(chunk)
# Extract .tar.gz on the fly:
import gzip
import tarfile
with gscio.AsyncReader(blob) as fh:
gzip_reader = gzip.GzipFile(fileobj=fh)
tf = tarfile.TarFile(fileobj=gzip_reader)
for tarinfo in tf:
process_my_tarinfo(tarinfo)
Installation
pip install gs-chunked-io
Links
Project home page GitHub
Package distribution PyPI
Bugs
Please report bugs, issues, feature requests, etc. on GitHub.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gs-chunked-io-0.2.5.tar.gz
(4.6 kB
view hashes)