NVTabular
Visit ToolNVTabular is a feature engineering and preprocessing library for tabular data that manipulates terabyte-scale datasets. It is designed to train deep learning-based recommender systems quickly and easily.
At a glance
Trending
NVTabular is a feature engineering and preprocessing library for tabular data that manipulates terabyte-scale datasets. It is designed to train deep learning-based recommender systems quickly and easily.
Trending
About
NVTabular is a powerful feature engineering and preprocessing library specifically designed for tabular data, enabling the manipulation of terabyte-scale datasets. It accelerates computation on the GPU using the RAPIDS Dask-cuDF library, making it ideal for training deep learning-based recommender systems. As a core component of NVIDIA Merlin, it seamlessly integrates with other Merlin tools like Merlin Models, HugeCTR, and Merlin Systems to provide end-to-end acceleration for recommender systems on the GPU. NVTabular addresses challenges such as processing huge datasets, managing complex data pipelines, and overcoming input bottlenecks, allowing data scientists and ML engineers to focus on data transformation rather than scaling issues. It significantly reduces the time required for feature engineering and preprocessing, with reported completion times of 13 minutes on a single V100 GPU and 3 minutes on a DGX-1 cluster for the Criteo 1TB Click Logs Dataset.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending