ShypdShypd.ai
Data & AnalyticsData Cleaning & PrepData Pipelines & IntegrationAI Frameworks & InfraOpen Source & ModelsFree

data-juicer

github.com

Data-Juicer is an open-source data processing system designed for foundation models. It offers modular building blocks for cleaning, synthesizing, and...

0
Views

Boost your confidence score by at least 15%

Page created: Mar 2, 2026·Last updated by Shypd: Mar 2, 2026

SHYPD CONFIDENCE SCORE

Likely Legit

PRICING

ModelFree

Explore more AI tools for Data Cleaning & Prep

Unstructured

Unstructured

74%

Unstructured is a data transformation platform that converts unstructured data into AI-ready JSON files. It captures data from various sources and processes over 64 file types. The platform is designed for organizations looking to integrate AI into their business by making previously inaccessible data usable. It is trusted by a large percentage of Fortune 1000 companies.

Simba Technologies

Simba Technologies

72%

Simba Technologies helps impact organizations modernize operations using WhatsApp and AI tools. It facilitates data collection from the field, providing simple, actionable, and scalable impact measurement. The platform supports multimedia responses and feedback surveys in 180+ countries and 100+ languages. Simba aims to align mission, data, and technology for better outcomes.

datumaro

datumaro

71%

Datumaro is a dataset management framework for computer vision. It is a Python library and CLI tool used to build, analyze, and manage computer vision datasets. The tool helps in transforming and analyzing datasets, providing a comprehensive solution for dataset handling.

Tabular-data-generation

Tabular-data-generation

71%

Tabular-data-generation is a repository that explores the use of Generative Adversarial Networks (GANs) for generating tabular data. It reviews recent papers on tabular GANs and examines their application in creating realistic synthetic data. The repository also covers TimeGANs and diffusion models for tabular data generation.

towhee

towhee

71%

Towhee is a framework designed to simplify and accelerate neural data processing pipelines. It focuses on processing unstructured data using Large Language Model (LLM) based pipeline orchestration. The tool helps extract insights from various unstructured data types. It streamlines the development of efficient data pipelines.

dataset-generator

dataset-generator

71%

Dataset-generator is an open-source tool for creating realistic datasets. It allows users to generate datasets for demos, learning, and dashboards. The tool provides real-time data previews and supports exporting data as CSV or SQL. It is useful for instantly creating and exploring data.

batchgenerators

batchgenerators

71%

batchgenerators is a framework for data augmentation in 2D and 3D image classification and segmentation tasks. It is an open-source tool available on GitHub. It allows researchers and developers to enhance their datasets for improved model training.

data-validation

data-validation

71%

TensorFlow Data Validation (TFDV) is a library for exploring and validating machine learning data. It is designed to be scalable and works with TensorFlow and TensorFlow Extended (TFX). TFDV helps ensure data quality and identify anomalies in machine learning datasets.

Curator

Curator

71%

Curator is a scalable data preprocessing and curation toolkit for LLMs. It is GPU-accelerated for faster processing. The tool supports modular pipelines for text, images, video, and audio data. Curator is part of the NVIDIA NeMo software suite.

Explore more Data & Analytics tools

Busel.ai

Busel.ai

80%

Busel.ai is an AI-powered tool that connects to Stripe, Google Analytics, and Search Console to provide SaaS businesses with a comprehensive overview of their metrics. It delivers daily email reports with key metrics, AI-driven insights, and curated readings. The platform helps users spot patterns, identify problems early, and make informed decisions.

Bet.AI: Betting Assistant

Bet.AI: Betting Assistant

79%

Bet.AI is an innovative AI-powered mobile application designed to provide sports bettors with a significant analytical edge. It functions as a personal AI betting analyst, transforming snapshots or screenshots of bet slips, game screens, or matchups into instant, comprehensive Betting Intelligence Reports. These reports are generated by analyzing over 1,000 signals, offering deep market intelligence, advanced team and player analytics, and uncovering hidden X-factors that traditional bookmakers might overlook. Key insights include odds shopping, fair value assessment, vig analysis, expected value (EV+), arbitrage opportunities, and consensus lines. The platform also delves into advanced team and player statistics, matchup form, injury reports, fatigue levels, and momentum trends. Furthermore, Bet.AI identifies situational contexts such as referee patterns, weather conditions, and travel fatigue, providing a holistic view beyond mere odds. By offering a real-time, expert-level read on any matchup in seconds, Bet.AI empowers both casual fans and serious bettors to make data-driven decisions, understand the underlying drivers of odds, and potentially improve their betting strategies.

ScoutAI: AI Football Forecast

ScoutAI: AI Football Forecast

79%

ScoutAI: AI Football Forecast represents a next-generation football analytics platform, meticulously engineered to provide unparalleled football intelligence through advanced Artificial Intelligence. This sophisticated tool is designed for a diverse audience, including passionate football fans, dedicated fantasy football players, and data-driven sports analysts. It goes beyond conventional predictions by processing thousands of intricate data points, encompassing historical team form, advanced individual performance metrics, head-to-head records, and complex tactical matchups, to deliver exceptionally reliable and in-depth football insights. ScoutAI distinguishes itself by generating high-quality, AI-driven match commentary, comprehensive pre-match summaries, and precise outcome expectations, all accompanied by clear, logical explanations. Furthermore, it offers advanced metrics such as Expected Goals (xG), effectively bridging the gap between cutting-edge data science and the nuanced beauty of the game, empowering users with a deeper, more analytical understanding of football.

Screen Url

Screen Url

78%

ScreenURL is a screenshot API for developers to capture website screenshots with a single API call. It provides pixel-perfect screenshots in milliseconds. It is suitable for social media previews, automated testing, website monitoring, and content aggregation. A free tier is available.

GradingMetric

GradingMetric

78%

GradingMetric is an AI-powered tool for trading card grading analysis. It predicts grades across PSA, BGS, CGC, and SGC. The tool provides Capital Score ROI metrics and defect detection. It also offers submit or hold recommendations for trading cards.

Baseline Core

Baseline Core

78%

Baseline Core is an open-source skills system designed for AI agents. It enables AI tools to perform tasks like market research, PRD writing, and sprint planning, grounded in specific business contexts. The system includes skills, frameworks, and reference files. It is compatible with tools like Claude Code, ChatGPT, and GitHub Copilot.

Browse Data & Analytics