AI Agents & Automation
Browsing page 139 of AI Frameworks & Infra in AI Agents & Automation. Sorted by confidence score — our independent quality rating.
vllora
vllora is a lightweight, open-source tool designed for real-time debugging of AI agents. It enables developers to trace, analyze, and optimize their AI agents instantly, providing live observability of calls, tool interactions, and agent workflows. The tool offers seamless integration with popular frameworks such as LangChain, Google ADK, and OpenAI, and supports all major AI providers through an OpenAI-compatible chat completions API. Key features include real-time tracing for monitoring agent interactions as they happen and full support for Model Context Protocol (MCP) servers, facilitating integration with external tools via HTTP and SSE. vllora is self-hostable and extensible, allowing users to add custom providers, tools, and MCP servers.
wasi-nn
wasi-nn is a proposed WebAssembly System Interface (WASI) API designed for performing machine learning (ML) inference within WebAssembly environments. It aims to simplify the integration of existing ML models, such as those from TensorFlow, ONNX, and OpenVINO, into WASI applications. The API focuses on ease of use by allowing users to load models as opaque byte sequences and achieve high performance through hardware acceleration like GPUs, TPUs, and FPGAs. While primarily focused on inference, the project acknowledges future potential for ML training. It is currently in Phase 2 of development, emphasizing a framework- and model-agnostic approach to support diverse ML frameworks and formats.
Cappius Technologies Inc.
Anblicks, formerly Cappius Technologies Inc., specializes in digital business transformation by leveraging data analytics and AI. They provide a comprehensive suite of services including strategy and architecture enablement, digital infrastructure preparation for AI, building governed enterprise data foundations, and scaling enterprise AI and decision intelligence. Anblicks also focuses on creating intelligent experiences and automation through AI-native applications and automated workflows. They offer industry-specific solutions across retail, healthcare, financial services, and real estate, aiming to drive measurable outcomes and accelerate time-to-value with domain-specific accelerators.
BentoML
BentoML offers a comprehensive inference platform designed for speed and control, enabling users to deploy any AI model anywhere. It provides tailored inference optimization, efficient scaling, and streamlined operations for various models, including LLMs and custom architectures. The platform simplifies inference infrastructure while offering full control over deployment, supporting open-source models and custom models alike. Key features include deployment automation, CI/CD, comprehensive observability, and fine-grained access control. BentoML also offers intelligent resource management with cross-region scaling, elastic auto-scaling, and cold-start acceleration, ensuring optimal compute utilization across various cloud environments or on-premises Kubernetes.
morphik-core
Morphik-core provides developers with an AI-native toolset for managing visually rich documents and multimodal data. It aims to simplify the integration of complex context into AI applications, addressing the limitations of traditional RAG approaches that often fail with diverse data types. The tool offers features like multimodal search, which understands visual content in documents, and fast, scalable metadata extraction including bounding boxes and classification. Morphik-core integrates with existing tools like Google Suite, Slack, and Confluence, and offers a free tier for developers to get started. It supports both cloud-based usage and self-hosting options, with Python SDK and REST API access.
OpenHGNN
OpenHGNN is an open-source toolkit designed for Heterogeneous Graph Neural Networks (HGNNs), built upon the Deep Graph Library (DGL) and PyTorch. It aims to facilitate research and development in heterogeneous graph-based machine learning by integrating state-of-the-art HGNN models. The toolkit offers easy-to-use interfaces for conducting experiments and supports various tasks including node classification, link prediction, and recommendation. Key features include extensibility for user-defined tasks, models, and datasets, efficiency through DGL's backend, and tools for hyperparameter optimization and visualization. It also supports mini-batch training and distributed training for large-scale graphs.
plock
Plock is an open-source, local-first development tool designed to streamline interaction with large language models (LLMs) and other scripts directly from any text input area. Users can write a prompt, select it, and trigger a command (e.g., Cmd+Shift+.) to replace the selected text with the streaming output of an LLM or custom script. It also supports using clipboard content as context for prompts. Plock is highly customizable via a `settings.json` file, allowing users to define shortcuts, integrate various models (like Ollama, GPT, Perplexity), and chain multiple actions. It emphasizes local execution by default but can be configured to use remote APIs through shell scripts. The tool is cross-platform, supporting Mac, Linux, and Windows (with some caveats).
Props AI
Props AI provides a comprehensive platform for developers and businesses to manage their AI applications effectively. It enables detailed usage tracking, offering insights into how AI models are being utilized. The platform also facilitates cost management, helping users optimize their spending on AI infrastructure. Furthermore, Props AI focuses on improving the overall user experience of AI applications by providing tools for monitoring performance and identifying areas for enhancement. It integrates with billing systems like Stripe for usage-based billing, making it suitable for monetizing AI services. The platform supports development in Python and JavaScript/TypeScript, catering to a broad range of AI projects.
clip-as-service
clip-as-service is an open-source tool designed for scalable embedding, reasoning, and ranking of images and text using the CLIP model. It can be easily integrated as a low-latency, high-scalability microservice into neural search solutions. Key features include fast serving of CLIP models with TensorRT, ONNX runtime, and PyTorch, offering up to 800QPS. It supports elastic scaling of multiple CLIP models on a single GPU with automatic load balancing. The tool provides an easy-to-use, minimalist API for both image and sentence embedding, supporting async clients and various protocols like gRPC, HTTP, and WebSocket. It also integrates smoothly with the Jina and DocArray neural search ecosystem, enabling the rapid building of cross-modal and multi-modal solutions.
super-gradients
Super-gradients is an open-source training library designed to simplify the process of building, training, and fine-tuning state-of-the-art computer vision models. It provides ready-to-deploy pre-trained models, including the high-performance YOLO-NAS and YOLO-NAS-POSE architectures, which outperform other YOLO versions in accuracy and speed. The library supports various computer vision tasks such as classification, semantic segmentation, object detection, and pose estimation. Users can easily load and fine-tune models with validated hyper-parameters, and all models are production-ready, compatible with deployment tools like TensorRT and OpenVINO. Super-gradients also offers advanced features like Quantization Aware Training (QAT) and Knowledge Distillation, along with support for Distributed Data Parallel (DDP) for efficient multi-GPU training.
deeppy
deeppy is an open-source deep learning framework designed for Python, leveraging NumPy for its core operations and offering CUDA acceleration to enhance computational performance. This makes it suitable for researchers and developers working on deep learning projects that require efficient processing. The framework aims to provide a Pythonic interface, allowing users to build and experiment with deep learning models using familiar Python constructs. Its foundation on NumPy ensures compatibility and ease of integration with the broader Python scientific computing ecosystem, while CUDA support addresses the need for high-speed parallel processing in deep learning tasks.
DQN-based-UAV-3D_path_planer
DQN-based-UAV-3D_path_planer, also known as RLGF, is a comprehensive open-source training framework designed for Unmanned Aerial Vehicle (UAV) deep reinforcement learning tasks. It provides a versatile environment for developing and testing UAV path planning solutions, supporting both continuous and discrete flight actions. The framework integrates a variety of mainstream deep reinforcement learning algorithms, including SAC, DQN, DDQN, PPO, Dueling DQN, and DDPG. Users can customize task environment models and UAV parameters via XML configuration files, enabling rapid development of specific UAV missions. It also offers training log support and multi-dimensional visualization of UAV trajectories, including static HTML outputs and dynamic visualization via a Java backend with MySQL integration.
ZenteiQ.ai
ZenteiQ.ai is an advanced AI platform designed to revolutionize engineering design by integrating physics-native AI with Scientific Foundation Models. It specializes in transforming complex simulation and experimental data into actionable intelligence, accelerating discovery, design, and industrial innovation across various sectors. The platform's capabilities are highlighted by its ability to handle intricate equations like the Heat Equation, Wave Equation, Navier-Stokes, and Schrödinger, indicating its application in highly technical and scientific domains. ZenteiQ.ai aims to provide intelligent surrogates for engineering design, enabling more efficient and accurate development processes.
ppq
PPL Quantization Tool (PPQ) is a powerful, open-source offline neural network quantization tool designed for industrial applications. It focuses on optimizing neural networks by converting floating-point operations to fixed-point operations, which significantly reduces computational costs and memory usage. This makes PPQ particularly suitable for deployment on edge devices where chip area and power consumption are limited. The tool offers a highly flexible and extensible framework, allowing users to customize quantization bit-width, granularity, and calibration algorithms for individual operators and tensors. PPQ's execution engine is specifically designed for quantization, supporting 99 common Onnx operator execution logics and native quantization simulation. It integrates with various inference frameworks like TensorRT, OpenVINO, and Onnxruntime, providing pre-built quantizers and export logic.
aios-core
aios-core is an open-source framework designed for AI-orchestrated full-stack development, empowering users to build AI-powered applications with greater control. It emphasizes a "CLI First" architectural premise, ensuring that all execution, decisions, and automation happen directly within the command-line interface. The framework introduces two key innovations: agentic planning, where specialized agents collaborate to create detailed PRD and architecture documents, and contextualized development, where a Scrum Master agent transforms these plans into hyper-detailed development stories for the `dev` agent. This approach aims to eliminate planning inconsistency and context loss, providing a comprehensive understanding for the development process. It supports various IDEs and CLIs, offering different levels of integration and automation.
Bender
Bender is an open-source abstraction layer built over MetalPerformanceShaders, designed to simplify the development and execution of neural networks on iOS devices. It addresses the growing need to run AI models directly on mobile devices, even if the training was done elsewhere. Bender provides an intuitive way to define and run neural networks using common layers like Convolution, Pooling, and FullyConnected. A key feature is its ability to load models trained in other frameworks, such as TensorFlow, by translating them into Bender layers. This eliminates the need to include TensorFlow's static library or rely solely on CPU execution, enabling GPU-accelerated inference on iOS. It aims to make on-device AI more accessible and performant for developers.
Chathero
Chathero is an AI agent platform designed for small and medium-sized businesses to automate customer service, marketing, and sales. It utilizes specialized AI agents that operate across various departments, ensuring consistent customer communication 24/7. The platform integrates a central AI knowledge base, fed from diverse sources like websites, shops, PDFs, and internal systems, eliminating the need for multiple training sessions and ensuring unified responses. Chathero supports communication across voice, webchat, and WhatsApp, all managed from a single AI logic. It's built for the mid-market, offering quick deployment, transparent pricing, and full GDPR compliance with EU hosting.
SchoolTool AI Helper by Acuity
SchoolTool AI Helper by Acuity is an AI learning companion designed to assist students with homework and revision across all academic levels. The app provides clear, detailed, and educational explanations for various subjects, from Math and Science to Literature and Philosophy. Users can receive step-by-step support to complete and improve their homework. For revision, Acuity generates study sheets, quizzes, and flashcards, helping students learn lessons more efficiently, track progress, and identify areas for improvement. It also features an AI coach named Mathilda, offering personalized assistance and pedagogically precise answers to academic questions. Acuity emphasizes privacy, requiring no account or personal data, and is available for instant use on mobile devices.
AI notes flashcard quiz maker
Flashcards AI is an innovative iOS application designed to help students convert their study notes into dynamic flashcards. Users can upload images or PDF documents containing their notes, and the AI instantly generates swipeable flashcards. This tool aims to enhance learning and memory retention by providing an efficient way to study. Key features include the ability to flip cards to reveal answers, organize flashcards by subject or topic, and a user-friendly interface. Flashcards AI is perfect for students looking to streamline their study process, prepare for exams more effectively, and improve their academic performance by turning overwhelming study materials into manageable, interactive learning aids.
RetNet
RetNet offers a minimal, pure PyTorch implementation of the Retentive Network, designed as a successor to the Transformer architecture for large language models. This repository focuses on aiding scientific and technological understanding and advancement, with code prioritizing correctness and readability over optimization. Key features include single-scale and multi-scale retention across parallel, recurrent, and chunkwise paradigms, as well as a multi-layer retentive network with FFN and LayerNorm. It also supports a causal language model built on top of the retentive network. The implementation utilizes Microsoft's xPos for positional encoding, with an alternative complex value encoding available for specific use cases, though it requires higher precision and memory.
rcnn
R-CNN (Regions with Convolutional Neural Network Features) is an open-source visual object detection system developed by Ross Girshick, Jeff Donahue, Trevor Darrell, and Jitendra Malik at UC Berkeley EECS. It integrates bottom-up region proposals with rich features extracted by a convolutional neural network. At the time of its release, R-CNN achieved a 30% relative improvement in detection performance on PASCAL VOC 2012, reaching 53.3% mean average precision. While no longer maintained and considered a historical artifact, it serves as a foundational work for more recent and advanced object detection methods like Fast R-CNN and Faster R-CNN. The codebase is available on GitHub and requires MATLAB and Caffe for installation and use.
MonkAI
MonkAI is a Brazilian deep-tech company specializing in AI agents, offering a comprehensive platform for their monitoring and management. The company operates on three core pillars: Products, Infrastructure, and Observability, aiming to provide AI that actively works rather than just promises. MonkAI focuses on building infrastructure for production-grade AI agents, incorporating advanced architectures like GRKMemory and MonkAI Trace. This approach ensures the development of reliable, scalable, and cost-efficient AI systems tailored for enterprises across various sectors. Their platform is designed to support businesses in deploying and overseeing their AI agents effectively, ensuring optimal performance and operational efficiency.
xoul.ai
xoul.ai is an innovative entertainment and storytelling platform powered by AI, designed for users to create, explore, and share AI characters (Xouls) and scenarios. The platform emphasizes freedom of expression and unfettered creativity, aiming to provide intentional and meaningful experiences rather than an endless stream of content. Users can chat with AI characters, generate images and voice messages, and even create entire worlds. It offers various subscription plans with different energy allowances, cell allocations for premium features, and enhanced streak rewards. The platform supports community engagement through Discord and Reddit, and offers referral bonuses for new sign-ups.
Microsoft Azure
Microsoft Azure provides a comprehensive suite of cloud computing services for developers and IT professionals to create, deploy, and manage applications. It supports multi-cloud, on-premises, and edge deployments with scalable and cost-efficient solutions. Users can choose between a free account, offering $200 credit and free access to popular services for 30 days, or a pay-as-you-go model with free monthly amounts for over 20 popular services for 12 months and 65+ always-free services. Azure integrates seamlessly with the Microsoft ecosystem, offering advanced analytics, machine learning, and AI tools, including pre-built models and customizable options for intelligent feature integration. It's designed to help users go from idea to deployment quickly and efficiently.