🤖

AI Agents & Automation

Browsing page 44 of RAG & Document AI in AI Agents & Automation. Sorted by confidence score — our independent quality rating.

All AI Frameworks & Infra Browser & Web Agents Chatbots & Conversational AI General-Purpose Agents Multi-Agent Systems Personal Assistants RAG & Document AI RPA Scheduling & Task Agents Voice Agents Workflow Agents

bert-extractive-summarizer

60%

bert-extractive-summarizer is an open-source Python library designed for extractive text summarization, building upon the HuggingFace Pytorch transformers library. The tool operates by first embedding sentences from the input text and then employing a clustering algorithm to identify and extract sentences closest to the cluster centroids, forming a concise summary. It also incorporates coreference resolution techniques, utilizing the neuralcoref library, to enhance the coherence and context of the generated summaries. Users can customize various parameters, including the number of sentences or ratio for the summary, and integrate custom models or Sentence-BERT for diverse summarization needs. The library supports GPU acceleration via CUDA by default if available, and offers a Flask service with Docker support for easy deployment.

BERT-NER

60%

BERT-NER is an open-source tool leveraging Google's BERT model for named entity recognition (NER), specifically fine-tuned on the CoNLL-2003 dataset. This updated version addresses shortcomings of the original by providing clearer annotations and improved data preprocessing and layer design, making it easier for developers to implement and modify. Users can experiment with different layer designs, such as CRF or Softmax, to optimize performance. The repository includes all necessary files, such as BERT model components, data directories, and evaluation scripts, along with detailed instructions for usage. It offers strong performance metrics on the CoNLL-2003 test set, including high accuracy, precision, recall, and F1 scores for various entity types like LOC, MISC, ORG, and PER.

contextgem

60%

ContextGem is a free, open-source LLM framework designed to radically simplify the extraction of structured data and insights from various documents. It eliminates extensive boilerplate code often required by other frameworks, significantly reducing development time and complexity. Key features include automated dynamic prompts, data modeling and validators, precise granular reference mapping, and multilingual support. ContextGem allows users to extract structured data, identify key aspects, and build complex extraction workflows through an intuitive API. It supports both cloud-based and local LLMs via LiteLLM integration and offers optimizations for accuracy, speed, and cost, making it ideal for in-depth single-document analysis.

deep-research-web-ui

60%

deep-research-web-ui is an AI-powered research assistant designed for iterative, deep research across various topics. It integrates search engines, web scraping, and large language models to provide comprehensive insights. Key features include real-time AI response streaming, a tree-structure visualization of the research process, and support for multiple languages. The tool ensures safety and security by processing all configurations and API requests locally in the browser. It also allows for exporting final research reports as Markdown or PDF and supports a wide range of AI providers like OpenAI compatible, DeepSeek, and Ollama, as well as web search providers like Tavily and Firecrawl. It can be deployed in a server mode with environment variables or client mode where users configure their own API keys.

PDF RAG AI

60%

PDF RAG AI offers an interactive AI assistant designed to help users extract information, answer questions, and perform various tasks. By typing in queries, users can receive helpful responses from the AI. This tool is particularly useful for interacting with PDF documents, enabling efficient information retrieval and understanding of content. Hosted on Hugging Face Spaces, it leverages advanced AI capabilities to provide a seamless conversational experience, making it easier to process and understand complex documents without manual effort. The platform aims to simplify data interaction and enhance productivity for users dealing with large volumes of information.

AND Solutions Pte. Ltd.,

60%

AND Solutions Pte. Ltd. offers a comprehensive suite of AI-powered software solutions designed for banks and financial institutions. Their platform integrates lending, document intelligence, and credit decisioning into a single connected system. Key products include Looms for end-to-end loan origination and management with automated workflows and flexible loan configurations, Mindox for intelligent document processing to reduce manual tasks and enhance data accuracy, and advanced credit scoring tools like Scorecard Builder and Custom AI Scoring for accurate risk assessment and automated approvals. The solutions are built to streamline operations, accelerate lending, and improve decision-making for financial institutions across Southeast Asia.

hedwig

60%

Hedwig is an open-source repository offering PyTorch deep learning models specifically designed for document classification tasks. Developed by the Data Systems Group at the University of Waterloo, it includes implementations of several prominent models such as DocBERT, Reg-LSTM, XML-CNN, HAN, Char-CNN, and Kim CNN. Each model directory contains a detailed README.md for further information. The project is designed for Python 3.6 and PyTorch 0.4, with clear instructions for environment setup using Anaconda and installation of dependencies. It also provides options for downloading necessary datasets like Reuters, AAPD, and IMDB, along with word2vec embeddings, making it a comprehensive resource for document classification research and application.

graphrag-local-ollama

60%

GraphRAG Local Ollama is an open-source adaptation of Microsoft's GraphRAG, designed to leverage local models via Ollama for LLM and embedding extraction. This tool eliminates the dependency on costly OpenAPI models, offering a cost-effective solution for knowledge graph implementations. It supports a variety of local models such as Llama3, Mistral, Gemma2, and Phi3, and integrates with Ollama for both language models and embedding models like nomic-embed-text. The setup process is straightforward, involving conda environment creation, Ollama installation, repository cloning, and specific `pip install` commands. Users can easily configure models and run indexing and querying operations, with options to visualize generated graphs using tools like Gephi or a provided Python script.

Cloud Contracts 365

60%

Cloud Contracts 365 is a powerful AI-powered contract management tool specifically designed for IT services, offering the precision of a lawyer in a fraction of the time. It enables users to create, review, and manage commercial contracts all in one place, helping to protect businesses, close deals faster, and significantly reduce legal fees. Key features include a Contract Builder for quick and simple contract creation with essential clauses for technology businesses, an AI-powered Contract Reviewer for precise analysis, risk scoring, and negotiation insights, and a Contract Manager for centralizing contracts, e-signatures, and automated renewal reminders. The platform is particularly beneficial for MSPs, ISVs, SaaS Providers, and Microsoft Partners.

jvector

60%

JVector is an advanced embedded vector search engine that tackles the challenges of exact nearest neighbor search in high-dimensional spaces, a problem known as the “curse of dimensionality.” It focuses on approximate nearest neighbor (ANN) search, offering a more efficient solution for large datasets. JVector is a graph-based index that combines the hierarchical structure of HNSW with the Vamana algorithm (from DiskANN) within each layer. Its architecture supports multi-layer graphs with nonblocking concurrency, allowing linear scaling with the number of cores. It also features a two-pass search design using lossily compressed representations for the first pass (PQ, BQ, Fused PQ) and more accurate representations for the second (Full resolution float32, NVQ), reducing memory usage and latency while preserving accuracy. JVector also uniquely allows for building larger-than-memory indexes using two-pass searches.

KAG

60%

KAG is an open-source logical form-guided reasoning and retrieval framework built upon the OpenSPG engine and large language models (LLMs). It specializes in creating logical reasoning and factual Q&A solutions for professional domain knowledge bases, effectively addressing the limitations of traditional RAG vector similarity calculations and GraphRAG noise. KAG supports logical reasoning and multi-hop factual Q&A, offering superior performance compared to current state-of-the-art methods. Its core features include knowledge and chunk mutual indexing, conceptual semantic reasoning for knowledge alignment, schema-constrained knowledge construction, and logical form-guided hybrid reasoning and retrieval.

instill-core

60%

Instill Core is a full-stack, open-source AI infrastructure tool designed for comprehensive data, model, and pipeline orchestration. It simplifies the complexities of building AI-first applications by offering ETL processing, AI-readiness, and capabilities for hosting open-source LLMs and RAG. The platform features a Pipeline builder for creating AI-first APIs and automated workflows, Components for connecting essential building blocks, and Artifact management to transform unstructured data into AI-ready formats. Instill Core also supports deploying and monitoring AI models without requiring extensive GPU infrastructure, making it accessible for various AI development needs. It provides client access via Console, CLI, and SDKs (Python, TypeScript).

kg-gen

60%

kg-gen is an AI tool designed for generating knowledge graphs from diverse text inputs. It can process both small and large texts, offering chunking capabilities for extensive documents, and effectively handles conversational messages while preserving role information and message order. The tool supports a wide range of API-based and local model providers through LiteLLM, including OpenAI, Ollama, Anthropic, and Gemini, and utilizes DSPy for structured output generation. Key features include clustering similar entities and relations, aggregating multiple graphs, and extracting relationships between concepts and speakers in conversations. It's ideal for creating graphs to assist with RAG, generating synthetic data, structuring text, and analyzing conceptual relationships.

CLEDAR

60%

CLEDAR offers an ontology-driven AI platform designed to transform fragmented enterprise data into actionable insights. Led by former CERN domain leaders, the platform unifies disparate data sources into a single, governed semantic context, laying the foundation for enterprise AI adoption. It features secure, modular infrastructure, a unified data foundation, and adaptive AI agents that automate workflows and execute end-to-end tasks autonomously. CLEDAR aims to boost productivity by cutting decision cycles from weeks to hours and optimize costs by reducing OPEX by up to 10%, helping companies scale AI from pilots to enterprise-wide impact.

PyABSA

60%

PyABSA is a modular and reproducible open-source framework designed for Aspect-based Sentiment Analysis (ABSA), bridging the gap from research to production. It offers a unified API for training, evaluation, and inference across multiple ABSA subtasks, including Aspect Polarity Classification (APC), Aspect Term Extraction & Polarity Classification (ATEPC), Aspect Sentiment Triplet Extraction (ASTE), and Aspect Category Opinion Sentiment Triplet Extraction (ASQP/ACOS). The framework comes with a Model Zoo of available checkpoints that auto-download, visualization tools for evaluation metrics, and helpers for dataset annotation. Additionally, PyABSA supports text augmentation for classification and adversarial defense, along with automatic device selection for CPU/GPU. It is ideal for researchers and developers working with sentiment analysis and natural language processing tasks.

Callidus Legal AI

60%

StrongSuit, previously known as Callidus Legal AI, is a comprehensive legal AI platform designed to enhance and speed up essential legal tasks for lawyers. It provides advanced AI legal research capabilities, including immediate answers to legal questions, analysis of complex fact patterns, and the ability to draft extensive memos and briefs. The platform also excels in contract redlining, allowing users to redline contracts significantly faster, summarize differences, compare against market standards, and generate AI-powered redline suggestions. Furthermore, StrongSuit assists with discovery and timelines, enabling the creation of timelines and statements of facts from relevant files, conducting document reviews, and improving writing. It aims to reduce hallucinations in legal research and offers a unified solution for various legal software needs.

rag-time

60%

RAG Time is a comprehensive 5-week learning journey designed to help users master Retrieval-Augmented Generation (RAG). Developed by Microsoft experts, this resource provides step-by-step guides, live coding samples, and expert insights to enable the creation of smarter AI applications. The program covers fundamental RAG concepts, building ultimate retrieval systems, optimizing vector indexes for scale, handling multimodal data, and exploring hero use cases, including Agentic RAG. It features exclusive video content, practical demonstrations, and sample code to facilitate hands-on learning, making complex concepts accessible through engaging visuals.

Search-R1

60%

Search-R1 is an open-source reinforcement learning framework designed for training large language models (LLMs) to effectively reason and make tool calls, specifically to search engines, in a coordinated manner. Built upon the veRL framework, it extends the concepts of DeepSeek-R1(-Zero) by integrating interleaved search engine access and offering a comprehensive RL training pipeline. This framework serves as an alternative to OpenAI DeepResearch, fostering research and development in tool-augmented LLM reasoning. It supports various RL methods like PPO, GRPO, and reinforce, accommodates different LLMs such as Llama3 and Qwen2.5, and integrates with diverse search engines including local sparse/dense retrievers and online search engines like Google and Bing.

semantra

60%

Semantra is a multipurpose command-line tool designed for semantic search across local documents, including text and PDF files. Unlike traditional keyword matching, Semantra allows users to query by meaning, providing a more intuitive and powerful search experience. It processes documents locally, launching a web search application for interactive querying. This tool is particularly useful for individuals needing to sift through large volumes of information, such as journalists analyzing leaked documents, researchers exploring academic papers, or students engaging with literature. Semantra prioritizes privacy and security by performing all analysis on the user's computer, and it offers configurable options for embedding models and search parameters.

File AI

60%

File AI is an AI-native data preparation and automation platform designed to unify data capture, governance, and orchestration into auditable AI workflows. It transforms unstructured data into trusted intelligence across various enterprise functions. The platform features fileForge, an AI-native data intelligence engine, alongside purpose-built solutions like fileLedger for financial operations automation and fileShield for intelligent case management in regulated environments. Key capabilities include multimodal AI OCR, classification, schema extraction, SOP-driven workflow engines, and over 100 ERP and system integrations. File AI aims to build the foundation for agentic AI at scale, providing the context, validation, and control needed for AI agents to act with confidence in real enterprise workflows.

Collate v1.7

60%

Collate is a privacy-first AI reader designed for Mac users, enabling them to chat with, summarize, and extract insights from PDF documents entirely offline. This local-first approach ensures that all processing runs directly on your device, guaranteeing complete privacy as your documents never leave your computer. It supports both Apple Silicon (M1, M2, M3) and Intel Macs running macOS 13.1 or later. Users can ask questions, get instant summaries, and receive citation-backed answers with automatic highlighting. Collate also supports multi-PDF chat for comparative research, folder organization, and the ability to export summaries and conversations in various formats like PDF, rich text, or email. It's completely free to download and use, with no subscription fees or usage limits.

sqlite-vss

60%

sqlite-vss is a SQLite extension designed to bring vector search capabilities directly into SQLite databases, leveraging the Faiss library for efficiency. It enables developers to build semantic search engines, recommendation systems, and question-and-answering tools by storing and querying vector embeddings. While not actively developed, with efforts now focused on sqlite-vec, it offers a robust solution for integrating vector search into applications using SQLite. Users can create virtual tables to store high-dimensional embeddings and perform k-nearest neighbor searches. It supports various languages through bindings like Python, Node.js, Deno, Ruby, Elixir, Go, and Rust, making it accessible to a wide range of developers.

table-transformer

60%

Table Transformer (TATR) is a deep learning model developed by Microsoft for extracting tables from unstructured documents, including PDFs and images. Based on object detection, TATR can be trained to work across various document domains, with pre-trained model weights available for the PubTables-1M dataset. The repository also provides the official code for the PubTables-1M dataset, a large-scale dataset for table detection, structure recognition, and functional analysis, and the GriTS evaluation metric for table structure recognition. Researchers and developers can use TATR to detect and recognize tables, convert them to HTML or CSV, and train custom models for specific needs.

talk2arxiv

60%

talk2arxiv is an open-source Retrieval-Augmented Generation (RAG) system specifically designed for academic paper PDFs. It enables users to chat with any ArXiv paper by simply modifying the paper's URL. The system features PDF parsing using GROBID for efficient text extraction, a custom chunking algorithm that organizes text by logical sections and recursive subdivision, and Cohere's EmbedV3 model for accurate text embeddings. It integrates with Qdrant for vector database storage and querying, which also caches research papers to avoid re-embedding. A reranking process ensures contextual relevance based on user input. The frontend is built with Typescript, ReactJS, TailwindCSS, and NextJS, while the backend utilizes Flask, Gunicorn, and Nginx.

EXPLORE OTHER CATEGORIES

🎨 Content & Design 📊 Productivity & Business 💻 Coding & Development 📚 Research & Education 🧘 Wellness & Lifestyle 💼 Career Development 📈 Marketing & Growth 📉 Data & Analytics 💬 Customer Support & CX 💰 Finance 🛒 E-commerce