Petastorm
Visit ToolPetastorm is an open-source data access library that enables single-machine or distributed training and evaluation of deep learning models directly from datasets in Apache Parquet format. It supports popular ML frameworks like Tensorflow, PyTorch, and PySpark.
At a glance
Trending