MosaicML StreamingDataset: Fast, Accurate Streaming of Training Data from Cloud Storage
Loading your training data becomes an escalating challenge as datasets grow bigger in size and the number of nodes scales. We built StreamingDataset to make training on large datasets from cloud storage as fast, cheap, and scalable as possible. Specially designed for multi-node, distributed training, StreamingDataset maximizes correctness guarantees, performance, and ease of use.
Comments
There aren't any comments yet. Be the first to comment!