AI Agents & Automation
Browsing page 47 of AI Frameworks & Infra in AI Agents & Automation. Sorted by confidence score — our independent quality rating.
KwaiAgents
KwaiAgents is an open-source project from KwaiKEG at Kuaishou Technology, offering a generalized information-seeking agent system built with Large Language Models (LLMs). The project includes KAgentSys-Lite, a simplified agent system with core functionalities, and KAgentLMs, a series of LLMs specifically tuned for agent capabilities such as planning, reflection, and tool-use. It also provides KAgentInstruct, a large dataset of agent-related instructions for fine-tuning, and KAgentBench, a comprehensive benchmark for evaluating agent performance across various dimensions. KwaiAgents supports both local and cloud-based LLM usage, making it a versatile platform for researchers and developers in the AI agent space.
LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse is a comprehensive GitHub repository designed as an ultimate resource for developers, researchers, and enthusiasts looking to leverage Large Language Models (LLMs). It provides a curated guide with tutorials, best practices, and ready-to-use code for custom training and inferencing of LLMs. The resource covers foundational concepts in mathematics, Python, neural networks, and natural language processing, progressing to advanced topics like LLM architecture, instruction dataset building, pretraining, fine-tuning, RLHF, and evaluation. It also includes sections on model compression, inference optimization, open LLMs, and resources for cost analysis. LLM-PowerHouse aims to empower users to build intelligent applications and push the boundaries of natural language understanding.
GPT4AllVerified
GPT4All is a free, open-source AI chatbot designed to run entirely on your local device, providing private and high-performance AI without requiring cloud connectivity. This means your data remains on your machine, ensuring privacy and security. It supports various operating systems including Windows, macOS, and Linux, and can run on CPU-only machines or systems with NVIDIA or AMD GPUs for faster inference. GPT4All is ideal for developers, teams, and AI power-users, offering full customization, local document chat (LocalDocs), and compatibility with thousands of open-source models. This empowers users to build assistants and workflows with maximum control, security, and speed, making it a robust solution for offline and privacy-sensitive AI applications.
loki-mode
loki-mode is an open-source, multi-agent autonomous system designed to transform product requirements into production-ready code. It operates by classifying complexity, assembling agent teams from 41 specialized types across 8 swarms, and running autonomous RARV (Reason - Act - Reflect - Verify) cycles with 9 quality gates. The system ensures code quality through blind 3-reviewer code review and anti-sycophancy checks, only delivering code that passes automated verification. It supports 5 AI providers (Claude, Codex, Gemini, Cline, Aider) with automatic failover and outputs full-stack solutions including source code, tests, Docker configs, CI/CD pipelines, and audit logs. The tool is self-hosted and private, ensuring data privacy and no vendor lock-in.
Ovanya
Ovanya is a company specializing in AI and Data Science, offering advanced solutions to help businesses achieve their organizational goals. Their team comprises AI experts, data scientists, software engineers, and business leaders who collaborate to develop innovative products. Ovanya's services encompass a range of AI applications, including computer vision, natural language processing (NLP), recommendation systems, data visualization, custom software development, and consultation. They aim to empower organizations to unlock maximum value from their data, providing expertise in areas like personalized greeting systems and product recommendation engines. The company is committed to developing solutions that drive growth and improve decision-making for their clients.
Convogenie AI
Convogenie AI offers an all-in-one platform for businesses to deploy and manage AI agents across various functions including sales, marketing, support, and operations. The tool is designed to automate lead capture, campaign management, and operational tasks, ensuring 24/7 autopilot functionality. It provides a unified workspace to manage conversations, knowledge, tasks, skills, and AI agents, catering to modern customer teams. Convogenie AI aims to transform customer engagement through intelligent AI conversations across multiple channels, enabling quick deployment of AI agents to drive meaningful interactions and streamline communication.
Platypus
Platypus offers a comprehensive set of code and resources for fine-tuning large language models (LLMs) from the Platypus family. Utilizing LoRA (Low-Rank Adaptation) and PEFT (Parameter-Efficient Fine-Tuning) techniques, it enables efficient and powerful refinement of models based on the LLaMA and LLaMa-2 transformer architectures. The repository includes scripts for fine-tuning, merging LoRA weights, and performing inference. It also provides a data pipeline for refining datasets, including keyword search, duplicate removal, and similarity checks using SentenceTransformers embeddings. Platypus is designed to be multi-GPU friendly, supporting both model and data parallelism, and offers guidance on reproducing benchmark evaluation results.
CydexCode Technologies Pvt Ltd
CydexCode Technologies Pvt Ltd specializes in delivering cutting-edge AI-powered software solutions, machine learning, and software development services. They focus on revolutionizing the digital landscape by offering tailored solutions that include advanced predictive analytics, custom ML solutions, secure cloud infrastructure, and DevOps automation. The company also provides full-stack development, enterprise-grade solutions, and data-driven marketing strategies covering paid media, SEO, and conversion optimization. CydexCode aims to solve business problems and achieve outcomes through state-of-the-art AI, advanced software engineering, and transformative innovations.
RagaAI-Catalyst
RagaAI-Catalyst is a comprehensive Python SDK designed to enhance the management and optimization of LLM projects. It provides robust features for AI Agent observability, monitoring, and evaluation, including detailed tracing for agents, LLMs, and various tools. The platform facilitates debugging of complex multi-agent systems and offers a self-hosted dashboard with advanced analytics, featuring timeline and execution graph views. Key functionalities span project management, dataset management, evaluation management, trace management, prompt management, synthetic data generation, and guardrail management. This allows users to efficiently evaluate, optimize, and safeguard their LLM applications, ensuring transparency and performance in AI agent operations.
skald
Skald is an open-source platform designed to provide a production-ready Retrieval-Augmented Generation (RAG) system for your infrastructure. It offers a plug-and-play API that allows for quick deployment and extensive configuration of your RAG engine. Skald handles the entire ingestion process, including document parsing, chunking, summarization, tagging, embedding generation, and vector storage. For retrieval, it manages query rewriting, vector search, LLM chat, chat history, and source references. While it comes with solid defaults suitable for most use cases, users can fine-tune various aspects such as vector search parameters, reranking, models, and chunking strategies. Skald also includes built-in evaluation tools to assess the performance of custom RAG engines and supports powerful filtering for improved response accuracy and speed.
self-adaptive-llms
self-adaptive-llms, also known as Transformer², is a novel self-adaptation framework designed to overcome the limitations of traditional, computationally intensive fine-tuning methods for Large Language Models (LLMs). This framework enables LLMs to adapt to unseen tasks in real-time by selectively adjusting only the singular components of their weight matrices. During inference, Transformer² utilizes a two-pass mechanism: first, a dispatch system identifies the task properties, and then task-specific "expert" vectors, trained using reinforcement learning, are dynamically mixed to achieve targeted behavior for incoming prompts. This approach significantly enhances the adaptability and efficiency of LLMs for diverse and novel tasks.
sentence-transformers
Sentence-transformers is a powerful open-source framework designed for generating state-of-the-art text embeddings. It simplifies the process of computing embeddings using Sentence Transformer models, calculating similarity scores with Cross-Encoder (reranker) models, and generating sparse embeddings via Sparse Encoder models. This framework unlocks a wide range of applications including semantic search, semantic textual similarity, and paraphrase mining. Users can leverage over 15,000 pre-trained models available on Hugging Face, or easily train and fine-tune their own custom embedding, reranker, or sparse encoder models. It supports various transformer networks like BERT, RoBERTa, and XLM-R, offers multilingual and multi-task learning, and includes over 20 loss functions for diverse NLP tasks.
simpletransformers
simpletransformers is an open-source Python library built upon HuggingFace's Transformers, designed to streamline the process of training and evaluating Transformer models. It significantly reduces the complexity, requiring only a few lines of code to initialize, train, and evaluate models for various Natural Language Processing (NLP) tasks. The library supports a wide array of applications including Information Retrieval (Dense Retrieval), Text Classification (binary, multi-class, multi-label), Token Classification (NER), Question Answering, Language Modelling, Language Generation, T5 Model Seq2Seq Tasks, Multi-Modal Classification, and Conversational AI. It offers task-specific models like ClassificationModel, ConvAIModel, and NERModel, each tailored with appropriate features and functionality. The library also integrates with Weights and Biases for experiment tracking and visualization, making it a powerful tool for developers and researchers working with Transformer models.
Cyphertech
AskCyph™ LITE by Cypher Tech Inc. is an innovative AI chatbot that redefines accessibility to AI by enabling in-browser AI model execution. This approach significantly enhances user privacy and security by processing AI models directly on the user's device. The tool aims to democratize AI, making advanced conversational AI capabilities available to a broader audience without compromising data integrity. It supports running various AI models locally, making it a versatile solution for individuals seeking private and secure AI interactions. Cyphertech focuses on empowering users with personal AI assistants that operate within their browser environment, ensuring that conversations and data remain confidential.
Cerebras
Cerebras is a leading platform for fast and effortless AI training and inference, powered by its Wafer-Scale Engine. It offers industry-leading speed, quality, and scale for deploying frontier models at production scale. The platform enables developers to build products that require instant answers, agents that never stall, and conversations that flow, with features like instant code debugging and multi-step workflow execution. Cerebras provides cloud, dedicated, and on-premise deployment options, supporting open models like GLM, OpenAI, Qwen, and Llama. It boasts up to 15x faster inference compared to GPU clouds and offers OpenAI API compatibility for easy integration, making it suitable for AI-native leaders, startups, and large enterprises.
supermemory-mcp
Supermemory MCP offers a universal memory solution for Large Language Models (LLMs), ensuring that your conversational memories from platforms like ChatGPT are accessible across various LLM clients. This tool eliminates the need for multiple logins or paywalls, simplifying memory management for AI interactions. It boasts an extremely fast and scalable architecture built on the Supermemory API, and is completely free to use. Users can set it up with a single command, making it highly accessible. While the primary repository is maintained, the latest version is available via app.supermemory.ai, with self-hosting options requiring an API key from console.supermemory.ai.
Stable-Diffusion-NCNN
Stable-Diffusion-NCNN is a C++ implementation of the Stable Diffusion model, leveraging the NCNN deep learning inference framework. This open-source project supports both text-to-image (txt2img) and image-to-image (img2img) generation capabilities. It is designed for efficient deployment on various hardware, including x86 Windows, x86 Linux, macOS, and Android, with specific performance metrics provided for different resolutions and devices. The implementation includes dynamic shape resolution, support for positive and negative prompts, and utilizes the Euler ancestral sampler. Users can download pre-compiled executables and models or compile the project from source, making it accessible for developers and researchers looking to integrate Stable Diffusion into their applications.
Hey Buddy
Hey Buddy is a leading AI-native company specializing in creative tech services, including AI, AR/VR, Metaverse, Web3, and game development. They offer a wide range of services such as CGI development, custom software development for Android and iOS, VR development for training, and Web3 development leveraging blockchain technology. Their AI solutions include conversational AI, RAG system development, and agentic AI to redefine business workflows. Hey Buddy caters to businesses of all scales, from startups to enterprises, providing industry-specific expertise in real estate, BFSI, education, and manufacturing to drive digital transformation and market leadership.
ToolBench
ToolBench is an open-source platform designed to advance the capabilities of large language models (LLMs) in tool learning. It focuses on constructing large-scale, high-quality instruction tuning data, automatically generated using ChatGPT (gpt-3.5-turbo-16k) with enhanced function call capabilities. The platform includes a vast collection of 16,464 real-world REST APIs from RapidAPI, curated instructions for both single-tool and multi-tool scenarios, and a novel depth-first search based decision tree (DFSDT) for answer annotation. ToolBench also provides the corresponding training and evaluation scripts, along with a capable model called ToolLLaMA, fine-tuned on its dataset. It aims to enable open-source LLMs to master thousands of diverse real-world APIs, offering a comprehensive environment for research and development.
TinyTroupe
TinyTroupe is an experimental Python library developed by Microsoft for LLM-powered multiagent persona simulation. It enables users to create and simulate artificial agents, called TinyPersons, with customizable personalities, interests, and goals within simulated TinyWorld environments. Leveraging Large Language Models like GPT-4 and GPT-5, TinyTroupe generates realistic simulated behavior, focusing on understanding human behavior for productivity and business insights rather than direct AI assistance. Key applications include evaluating digital ads, providing test input for software, generating realistic synthetic data, offering feedback on product proposals from various personas, and simulating focus groups for brainstorming. The library is open-source and actively under development, with frequent updates to its API and features.
ultravox
Ultravox is a fast multimodal LLM designed for real-time voice interactions, developed by Fixie.ai. It distinguishes itself by understanding both text and human speech directly, eliminating the need for a separate Audio Speech Recognition (ASR) stage. This direct coupling enables Ultravox to respond much more quickly than traditional systems. The model is built on research from AudioLM, SeamlessM4T, Gazelle, and SpeechGPT, extending open-weight LLMs like Llama 3, Mistral, and Gemma with a multimodal projector. It currently takes audio input and emits streaming text, with future plans to emit speech tokens for direct audio conversion. Ultravox offers an 8B variant on Hugging Face and allows for training against any open-weight model, making it highly customizable for various use cases.
transformers-php
Transformers PHP is a robust toolkit designed for PHP developers to seamlessly integrate state-of-the-art machine learning into their applications. Functionally equivalent to the popular Python library, it leverages Hugging Face's Transformers library to offer access to thousands of pre-trained models across over 100 languages. The library supports a wide array of tasks including text generation, summarization, translation, sentiment analysis, and image classification. It utilizes ONNX Runtime for high-performance model execution, allowing developers to convert PyTorch or TensorFlow models to ONNX using 🤗 Optimum. Installation is straightforward via Composer, with clear instructions for enabling the necessary PHP FFI extension. Transformers PHP also provides a pipeline API for ease of use, mirroring the Python library's approach, and offers configuration options for cache directories, remote hosts, and authentication tokens. A command-line tool is available for pre-downloading models to optimize performance.
Transformers.jl
Transformers.jl offers a Julia implementation of transformer-based models, built upon the Flux.jl deep learning library. This tool is designed for machine learning researchers and developers working within the Julia ecosystem, facilitating the implementation of Natural Language Processing (NLP) tasks. It provides functionalities for using pretrained models, such as BERT, and includes utilities for text encoding, tokenization, and processing. The library supports various transformer architectures, enabling users to experiment with and deploy advanced AI models directly in Julia. It is actively maintained with ongoing updates and community support through GitHub issues and Julia's Slack/Discourse channels.
NordAxon
NordAxon provides comprehensive AI consulting, custom machine learning solutions, and specialized training services from Malmö, Sweden. They assist organizations in navigating transformative technologies like Artificial Intelligence and Machine Learning, focusing on both proof-of-concepts and ambitious, disruptive ideas. Their end-to-end delivery covers everything from initial use case investigation to the deployment of AI solutions. Additionally, NordAxon offers AI education, including courses, seminars, and workshops, tailored for leaders and employees to build knowledge, experience, and confidence in AI. They also provide AI advisory services to analyze organizational AI/ML maturity and embed AI strategy across business units.