🤖

AI Agents & Automation

Browsing page 82 of AI Frameworks & Infra in AI Agents & Automation. Sorted by confidence score — our independent quality rating.

All AI Frameworks & Infra Browser & Web Agents Chatbots & Conversational AI General-Purpose Agents Multi-Agent Systems Personal Assistants RAG & Document AI RPA Scheduling & Task Agents Voice Agents Workflow Agents

Otter

61%

Otter is an open-source multi-modal model developed by EvolvingLMMs-Lab, built upon the OpenFlamingo architecture. It excels in instruction-following and in-context learning, trained extensively on the MIMIC-IT dataset, which comprises 2.8 million interleaved image-text/video instruction-response pairs. Otter supports various tasks, including scene comprehension, reasoning, and multi-round conversations, and can process both image and video inputs. The project also introduces OtterHD for fine-grained interpretations of high-resolution visual input and MagnifierBench for evaluating tiny object recognition. It provides training scripts, pre-trained weights, and supports integration with Hugging Face models.

OpenSandbox

61%

OpenSandbox is a robust, open-source sandbox platform designed for AI applications, offering a secure, fast, and extensible runtime environment for AI agents. It provides multi-language SDKs in Python, Java/Kotlin, JavaScript/TypeScript, C#/.NET, and Go, along with unified sandbox APIs. The platform supports both Docker and high-performance Kubernetes runtimes, enabling local execution and large-scale distributed scheduling. OpenSandbox is ideal for scenarios such as Coding Agents, GUI Agents, Agent Evaluation, AI Code Execution, and RL Training. It features strong isolation with secure container runtimes like gVisor and Firecracker microVM, and includes built-in Command, Filesystem, and Code Interpreter implementations.

opik

61%

Opik, built by Comet, is an open-source platform designed to streamline the entire lifecycle of LLM applications, from prototype to production. It empowers developers to evaluate, test, monitor, and optimize their models and agentic systems with comprehensive tracing of LLM calls, conversation logging, and agent activity. Key features include advanced evaluation capabilities like LLM-as-a-judge for tasks such as hallucination detection and RAG assessment, experiment management, and integration into CI/CD pipelines. Opik also offers production-ready scalable monitoring dashboards, online evaluation rules, and dedicated SDKs for prompt and agent optimization, along with guardrails for safe AI practices. It supports a wide array of frameworks and offers client SDKs for Python, TypeScript, and Ruby.

onepanel

61%

Onepanel is an open-source, end-to-end computer vision platform designed to streamline the entire computer vision lifecycle. It provides a unified environment for labeling datasets, building models, training, tuning hyperparameters, deploying, and automating computer vision workflows. The platform is built to be flexible, supporting deployment on any cloud infrastructure as well as on-premises environments. By integrating various open-source projects like Argo, Couler, CVAT, JupyterLab, and NNI, Onepanel offers a comprehensive solution for machine learning and deep learning practitioners. It aims to simplify complex computer vision tasks from data preparation to model deployment and automation.

PhiFlow

61%

PhiFlow is an open-source simulation toolkit designed for machine learning and optimization, primarily written in Python. It offers a differentiable PDE solving framework that seamlessly integrates with popular machine learning frameworks such as NumPy, PyTorch, Jax, and TensorFlow. This integration allows users to leverage automatic differentiation for building end-to-end differentiable functions that combine learning models with physics simulations. PhiFlow supports a wide range of applications, particularly in fluid dynamics, with features like built-in PDE operations, a flexible web interface for live visualizations, and object-oriented design for extensibility. It enables reusable simulation code across different backends and dimensionalities, making it a versatile tool for researchers and developers.

Brain4Industry

61%

Brain4Industry is a scientific-industrial consortium dedicated to enhancing the competitiveness of small and medium-sized manufacturing enterprises in the Czech Republic. It achieves this by facilitating the adoption of innovative digital solutions, additive manufacturing systems, and artificial intelligence. The consortium offers a comprehensive suite of services including digitalization and AI consulting, digital twin development, AI data management, and AI production assistance for R&D. Additionally, Brain4Industry provides expertise in AM research and product development, mechanical design, mathematical simulations, VR/AR applications, and prototyping. The platform also offers educational programs, financial consulting, and support for ESG reporting, acting as a one-stop shop for businesses seeking to integrate advanced technologies and improve sustainability.

pytorch-frame

61%

PyTorch Frame is a modular deep learning framework built upon PyTorch, specifically designed for heterogeneous tabular data. It supports various column types including numerical, categorical, text, time, and images, enabling the creation of sophisticated neural network models. The library provides a flexible architecture for implementing existing and future deep learning methods, featuring state-of-the-art models, user-friendly mini-batch loaders, and benchmark datasets. It also facilitates integration with diverse model architectures, including Large Language Models, allowing users to encode text data with embeddings and train alongside other complex semantic types. PyTorch Frame aims to democratize deep learning research for tabular data, making it accessible for both novices and experts.

redis-inference-optimization

61%

redis-inference-optimization is a Redis module designed for serving tensors and executing deep learning graphs. Previously known as RedisAI, this tool acts as a "workhorse" for model serving, offering support for popular Deep Learning and Machine Learning frameworks such as PyTorch, TensorFlow, TensorFlow Lite, and ONNXRuntime. It maximizes computation throughput and reduces latency by adhering to data locality principles, while simplifying the deployment and serving of graphs through Redis's robust infrastructure. Although the project is no longer actively maintained or supported, it provides a valuable reference for integrating AI inference capabilities directly within a Redis environment. Users are directed to the Redis website for current AI offerings.

rasa-demo

61%

Rasa-demo features Sara, a contextual AI assistant designed to demonstrate the capabilities of the open-source Rasa framework. Sara helps developers understand the Rasa framework, get started with its tools, and answers frequently asked questions. It can also direct technical questions to specific documentation, subscribe users to the Rasa newsletter, and request calls with the sales team. The repository provides all necessary files for installation and running the bot, including custom actions that can connect to external services like MailChimp and Google Sheets, requiring specific credentials for full functionality. This demo is ideal for those looking to explore and implement conversational AI solutions using Rasa.

61%

search is an open-source Go library designed for embedded vector search and semantic embeddings, utilizing llama.cpp. It offers an efficient solution for projects requiring semantic power without the complexities of traditional search systems. The library supports GGUF BERT models and provides GPU acceleration for quicker computations. It's particularly well-suited for datasets with fewer than 100,000 entries, offering features like llama.cpp integration without cgo, support for various BERT models in GGUF format, and precompiled binaries with Vulkan GPU support. Users can create and save search indexes from computed embeddings, enabling basic vector-based searches in Go applications.

SiT

61%

SiT (Scalable Interpolant Transformers) offers an official PyTorch implementation for exploring advanced generative models. Built on the foundation of Diffusion Transformers (DiT), SiT introduces an interpolant framework that allows for flexible connections between distributions, surpassing DiT's performance on the conditional ImageNet 256x256 benchmark with identical backbones and parameters. This repository includes pre-trained class-conditional SiT models, a training script utilizing PyTorch DDP, and sampling code with various configurable options for ODE and SDE samplers. Researchers and developers can leverage SiT to experiment with discrete vs. continuous time learning, different model predictions, interpolant choices, and deterministic or stochastic sampling strategies.

soprano

61%

Soprano is an ultra-lightweight, on-device text-to-speech (TTS) model designed for expressive, high-fidelity speech synthesis at unprecedented speed. It boasts features like up to 20x real-time generation on CPU and 2000x real-time on GPU, lossless streaming with low latency, and minimal memory usage with a compact 80M parameter architecture. Soprano supports infinite generation length with automatic text splitting and crystal clear audio generation at 32kHz. It offers widespread support for CUDA, CPU, and MPS devices on Windows, Linux, and Mac, and provides an OpenAI-compatible endpoint, ONNX, WebUI, CLI, and Python script for easy and production-ready inference.

RWKV-Runner

61%

RWKV-Runner is a comprehensive tool designed to eliminate barriers to using large language models by automating their management and startup. Weighing in at only 8MB, it provides a lightweight executable program that handles everything from model management and one-click startup to automatic dependency installation. A key feature is its compatibility with the OpenAI API, effectively turning any ChatGPT client into an RWKV client. It supports various configurations, including pre-set multi-level VRAM configs and WebGPU for broader graphics card compatibility (AMD, Intel). The tool also includes a user-friendly chat, completion, and composition interface, along with features like chat presets, attachment uploads, MIDI hardware input, and track editing. It offers built-in model conversion, download management, remote model inspection, and one-click LoRA Finetune (Windows Only). Additionally, it can function as a client for OpenAI ChatGPT, GPT-Playground, and Ollama, supporting multilingual localization and automatic updates.

Text Generation Inference (TGI)

61%

Text Generation Inference (TGI) is an open-source toolkit designed for deploying and serving Large Language Models (LLMs) with high performance. Developed by Hugging Face, it's used in production for services like Hugging Chat and the Inference API. TGI supports popular open-source LLMs including Llama, Falcon, and BLOOM, offering features such as tensor parallelism for faster inference on multiple GPUs, token streaming, and continuous batching for increased throughput. It also includes optimized transformers code with Flash Attention and Paged Attention, various quantization methods (bitsandbytes, GPT-Q, AWQ, Marlin, fp8), and hardware support for Nvidia, AMD, Inferentia, Intel GPU, Gaudi, and Google TPU. While TGI is now in maintenance mode, it has influenced the development of other optimized inference engines like vLLM and SGLang, which Hugging Face now recommends.

Factri.Ai

61%

Factri.Ai specializes in delivering AI-powered plug-and-play solutions tailored for manufacturing companies. Leveraging deep domain expertise in industrial engineering, digital transformation, and AI research, the platform builds practical and easy-to-implement solutions for complex manufacturing challenges. Factri.Ai aims to make advanced technology accessible, affordable, and scalable for factories, enabling them to benefit from digital transformation. Their solutions are designed for rapid deployment, often remotely, in a matter of days, ensuring quick integration and tangible results for industrial operations.

stable_diffusion.openvino

61%

stable_diffusion.openvino is an open-source implementation of text-to-image generation using Stable Diffusion, specifically designed for efficient performance on Intel CPUs or GPUs. This tool allows users to generate images from text descriptions, offering capabilities like text-to-image, image-to-image, and inpainting. It supports various parameters for fine-tuning image generation, including model selection, inference device, random seed, guidance scale, and initial image strength. The project provides clear instructions for installation on Linux, Windows, and MacOS, requiring Python <= 3.9.0 and OpenVINO™ Development Tools. Performance benchmarks are included, showcasing its efficiency across different Intel processors.

tribuo

61%

Tribuo is an open-source Java machine learning library developed by Oracle Labs' Machine Learning Research Group. It supports a wide range of prediction tasks including multi-class classification, regression, clustering, anomaly detection, and multi-label classification. The library provides its own implementations of various ML algorithms and also integrates with external tools like TensorFlow, ONNX Runtime, and XGBoost. A key feature is its use of the OLCUT configuration system, allowing repeatable model building from XML or JSON files. Tribuo emphasizes reproducibility with serializable provenance objects for models and evaluations, tracking data, transformations, and hyperparameters. It also supports exporting many models in ONNX format for deployment across different platforms.

vllm-omni

61%

vllm-omni is a framework designed for efficient model inference and serving of omni-modality models, building upon the foundation of vLLM. It expands support beyond text-based autoregressive generation to include text, image, video, and audio data processing. The framework also accommodates non-autoregressive architectures like Diffusion Transformers (DiT) and other parallel generation models, enabling heterogeneous outputs. Key features include state-of-the-art autoregressive support through efficient KV cache management, pipelined stage execution for high throughput, and fully disaggregated architecture with dynamic resource allocation. It offers flexibility with heterogeneous pipeline abstraction, seamless integration with Hugging Face models, and support for various parallelism techniques for distributed inference. vllm-omni also provides streaming outputs and an OpenAI-compatible API server.

workflow-builder-template

61%

Workflow-builder-template is an open-source template designed for developers to build their own visual AI workflow automation platforms. Built on top of Workflow DevKit, it offers a comprehensive drag-and-drop interface powered by React Flow, enabling users to design complex workflows with ease. The template includes real integrations with popular services like Resend (emails), Linear (tickets), Slack, PostgreSQL, and external APIs. A key feature is its code generation capability, converting visual workflows into executable TypeScript code with the "use workflow" directive. It also supports AI-powered workflow generation from natural language descriptions using OpenAI, secure user authentication with Better Auth, and detailed execution tracking with logs. The modern UI is built with shadcn/ui and Tailwind CSS, and it uses PostgreSQL with Drizzle ORM for type-safe database access.

MemOS

61%

MemOS is a Memory Operating System designed for Large Language Models (LLMs) and AI agents, unifying storage, retrieval, and management of long-term memory. It facilitates context-aware and personalized interactions by integrating knowledge bases, multi-modal data, and tool memory with enterprise-grade optimizations. Key features include a unified memory API structured as a graph, native support for text, images, and tool traces, and multi-cube knowledge base management for isolation and dynamic composition. MemOS also offers asynchronous ingestion via MemScheduler for production stability and allows memory refinement through natural-language feedback. It boasts significant accuracy improvements and token savings over OpenAI Memory, making it a robust solution for advanced AI agent development.

Qubic

61%

Qubic is a high-performance Layer 1 blockchain built on Useful Proof of Work (UPoW), designed to achieve true decentralized intelligence. It offers instant finality, feeless transactions, and the fastest smart contracts, with a verified peak TPS of 15.52M. Qubic is the first blockchain to integrate artificial neural networks at the protocol level, aiming to build Artificial General Intelligence (AGI) through its Aigarth project by 2027. The platform is fully open source and community-driven, with no pre-mine or venture capital funding. It addresses common blockchain challenges by eliminating transaction fees, repurposing mining energy for AI training, and providing genuine scalability without sacrificing decentralization.

xtream - Digital Products & AI Solutions

61%

xtream specializes in developing high-quality digital products and AI solutions for businesses. Their mission is to demonstrate that quality in design and execution always yields positive returns, contrasting with the time, money, and embarrassment often caused by poor implementations. They offer tailor-made services in both AI Solutions and Digital Products, built with expertise and knowledge to ensure they become valuable assets for their clients. The company emphasizes a hands-on approach, as highlighted by their featured case study with WeRoad, where they combined UX and AI to optimize tour planning, leading to faster and better decision-making. xtream is based in Milan, Italy, and serves a range of customers, from scale-ups to large corporations.

Hailo

61%

Hailo offers breakthrough AI processors specifically designed for high-performance deep learning applications on edge devices. Their product portfolio includes AI accelerators like the Hailo-8 and Hailo-10H, which are cost-efficient, low-power co-processors for real-time inference tasks. They also provide AI Vision Processors such as the Hailo-15L and Hailo-15H, which are AI-centric camera SoCs with high-performance AI processing for image enhancement and rich video analytics, including GenAI-powered smart search. Hailo's solutions are geared towards the new era of generative AI on the edge, alongside enabling perception and video enhancement across various applications like robotics, automotive, security, industrial automation, and retail. They also offer a comprehensive AI Software Suite to support their hardware.

KS Smart Solutions

61%

KS Smart Solutions, incorporated in 2016, offers bleeding-edge automated solutions tailored to client needs across diverse industries. They focus on digital transformation through Industry 4.0, AI, automation, and cloud technologies. Their expertise spans VR/AR solutions, Machine Learning/AI, web/mobile app development, and enterprise solutions. KS Smart Solutions aims to help clients achieve their digital objectives by developing customized IT solutions, enhancing agility, productivity, quality, and sustainability. They serve sectors like manufacturing, smart cities, defense, education, dairy, and entertainment, providing innovative solutions from immersive VR simulations for training to integrated inventory management systems.

EXPLORE OTHER CATEGORIES

🎨 Content & Design 📊 Productivity & Business 💻 Coding & Development 📚 Research & Education 🧘 Wellness & Lifestyle 💼 Career Development 📈 Marketing & Growth 📉 Data & Analytics 💬 Customer Support & CX 💰 Finance 🛒 E-commerce