ShypdShypd.ai
Data & AnalyticsWeb Scraping & ExtractionMarket ResearchStrategy & PlanningBrowser & Web AgentsSafety & SecurityFree

taranis-ai

Visit taranis-ai

Taranis AI is an open-source intelligence (OSINT) tool that uses AI to gather information and perform situational analysis. It navigates diverse data sources...

0
Views

Boost your confidence score by at least 15%

Page created: Mar 2, 2026·Last updated by Shypd: Mar 2, 2026

SHYPD CONFIDENCE SCORE

Likely Legit

PRICING

ModelFree

CHECK OTHER WEB SCRAPING & EXTRACTION AI TOOLS

Reworkd

Reworkd

72%

Reworkd is an AI-powered platform that optimizes web data extraction. It generates and repairs scraping code, adapting to website changes automatically. Reworkd's no-code interface allows companies to scale their web data extraction without building individual scraping bots. It offers a community-driven initiative for AI democratization.

PDFAnnotations

PDFAnnotations

72%

PDFAnnotations is a tool for turning PDF highlights into structured notes. It is a privacy-first, local-browser tool designed for creating a second brain. The tool allows one-click export to Notion, Obsidian, and Markdown with clean formatting. It also features smart filters to sort and export by color or annotation type.

Kadoa

Kadoa

72%

Kadoa is an AI-powered, no-code platform for web data extraction. It allows users to scrape web data, monitor changes, and integrate insights into workflows. Kadoa supports smart navigation and autonomous operation for accurate and up-to-date data collection. It caters to various use cases, including e-commerce and AI training.

deep-text-recognition-benchmark

deep-text-recognition-benchmark

71%

Deep-text-recognition-benchmark is a PyTorch implementation for text recognition using deep learning methods. It provides a four-stage STR framework suitable for most existing STR models. The tool allows for module-wise contributions to performance in terms of accuracy. It includes training and evaluation data, failure cases, and cleansed labels.

trafilatura

trafilatura

71%

Trafilatura is a Python and command-line tool designed for gathering text and metadata from the web. It facilitates crawling, scraping, and extraction of data. The tool supports output in various formats, including CSV, JSON, HTML, MD, TXT, and XML. It is useful for researchers and developers needing to process web content.

yake

yake

71%

YAKE is an unsupervised automatic keyword extraction method for single documents. It uses text statistical features to select important keywords. YAKE requires no training and is a lightweight solution. It can be used for text summarization and content tagging.

View all Web Scraping & Extraction tools →