Skip to content

Pull requests: mosaicml/streaming

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

remove v0.7.0 warning
#788 opened Sep 23, 2024 by eitanturok Loading…
3 of 8 tasks
Update huggingface-hub requirement from <0.25,>=0.23.4 to >=0.23.4,<0.26 dependencies Pull requests that update a dependency file
#787 opened Sep 23, 2024 by dependabot bot Loading…
Refactor spanner to avoid creating large array
#773 opened Sep 3, 2024 by XiaohanZhangCMU Loading…
8 tasks done
Bump databricks-sdk from 0.29.0 to 0.30.0 dependencies Pull requests that update a dependency file
#761 opened Aug 19, 2024 by dependabot bot Loading…
Check file size within LocalUploader
#751 opened Aug 13, 2024 by XiaohanZhangCMU Loading…
8 tasks
Heterogeneous
#684 opened May 24, 2024 by XiaohanZhangCMU Draft
8 tasks
Update google-cloud-storage requirement from <2.11.0,>=2.9.0 to >=2.9.0,<2.17.0 dependencies Pull requests that update a dependency file
#641 opened Mar 25, 2024 by dependabot bot Loading…
parallel merge index
#590 opened Feb 5, 2024 by XiaohanZhangCMU Loading…
8 tasks
Add varint to MDS
#574 opened Jan 23, 2024 by knighton Loading…
Add options to precompute the epoch
#569 opened Jan 20, 2024 by knighton Loading…
Nuke 1) torch dist, 2) shared memory, and 3) filelock
#556 opened Dec 30, 2023 by knighton Loading…
Add fine-grained timings to Writers
#555 opened Dec 30, 2023 by knighton Loading…
Let's blow away dist, and also shared memory
#552 opened Dec 26, 2023 by knighton Draft
2 of 3 tasks
Parquet streaming [WIP]
#538 opened Dec 15, 2023 by knighton Loading…
"Golden spike" PR
#488 opened Oct 28, 2023 by knighton Draft
Hf ingestion
#483 opened Oct 23, 2023 by XiaohanZhangCMU Loading…
8 tasks
Modify dataframe_to_mds to accept streaming DF
#478 opened Oct 20, 2023 by maddiedawson Loading…
8 tasks
Training on PQ shards
#443 opened Sep 22, 2023 by knighton Loading…
8 tasks
tag shared and temp files with username
#430 opened Sep 11, 2023 by acutkosky Loading…
3 of 8 tasks
Parallelize StreamingDataset index downloads.
#285 opened Jun 2, 2023 by knighton Loading…
8 tasks
Shared lock
#250 opened Apr 29, 2023 by knighton Loading…
8 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.