Docta
Visit ToolDocta is an open-source data-centric AI platform that diagnoses and rectifies issues in your data. It helps improve model performance by curing unhealthy data, supporting tabular, text, and image data.
At a glance
Trending
Docta is an open-source data-centric AI platform that diagnoses and rectifies issues in your data. It helps improve model performance by curing unhealthy data, supporting tabular, text, and image data.
Trending
About
Docta is an advanced open-source data-centric AI platform designed to detect and rectify issues within various data types, including tabular, text, and image data, as well as pre-trained model embeddings. It aims to improve model performance by ensuring data health through diagnosis, curation, and nutrition services. The tool is training-free, making it a premium-free option that operates on user data without additional prerequisites. Docta can identify label errors, as demonstrated with LLM alignment data (e.g., Anthropic's HH-RLHF dataset) and real-world human-annotated image data like CIFAR-N. It also excels at detecting rare patterns in datasets, which can be crucial for enhancing data quality and model robustness. The platform provides diagnosis reports and suggests corrections, such as improved ratings for LLM responses.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending