🤖

AI Agents & Automation

Browsing page 148 of AI Frameworks & Infra in AI Agents & Automation. Sorted by confidence score — our independent quality rating.

All AI Frameworks & Infra Browser & Web Agents Chatbots & Conversational AI General-Purpose Agents Multi-Agent Systems Personal Assistants RAG & Document AI RPA Scheduling & Task Agents Voice Agents Workflow Agents

Miniworld

58%

MiniWorld is a minimalistic 3D interior environment simulator specifically designed for reinforcement learning and robotics research. It allows users to simulate environments featuring rooms, doors, hallways, and various objects, making it suitable for tasks like training AI agents in office, home, or maze-like settings. Written 100% in Python, MiniWorld is easily modifiable and extensible, offering features such as few dependencies, good performance, lightweight design, and support for domain randomization for sim-to-real transfer. It also provides fully observable top-down views, depth map production, and the ability to display alphanumeric strings on walls. This project has been deprecated as of August 11, 2025, and is no longer receiving updates or support.

DeepCTR-Torch

58%

DeepCTR-Torch is a comprehensive, open-source Python package designed for building and experimenting with deep learning-based Click-Through Rate (CTR) models, leveraging the PyTorch framework. It offers a modular and extensible architecture, allowing users to easily implement and customize a wide range of CTR models, including popular architectures like DeepFM, xDeepFM, and Wide & Deep. The package includes numerous core component layers, enabling data scientists and researchers to construct their own custom models efficiently. With its user-friendly API, DeepCTR-Torch simplifies the process of training and predicting with complex models using standard `model.fit()` and `model.predict()` functions, making it an invaluable tool for recommendation systems and advertising applications.

dl-docker

58%

dl-docker offers an all-in-one Docker image designed for deep learning, simplifying the setup process by pre-packaging popular frameworks such as TensorFlow, Caffe, Theano, Keras, and Torch. It supports both CPU and GPU configurations, with the GPU version including CUDA 8.0 and cuDNN v5. The image also comes with essential libraries like iPython/Jupyter Notebook, Numpy, SciPy, Pandas, Scikit Learn, Matplotlib, and OpenCV. Users can either pull pre-built CPU images from Docker Hub or build both CPU and GPU versions locally. This solution addresses the 'dependency hell' often encountered when installing multiple deep learning frameworks, providing an isolated and fully functional OS environment for development.

DriveLM

58%

DriveLM is an open-source project focused on advancing autonomous driving research through Graph Visual Question Answering (GVQA). It provides comprehensive datasets, DriveLM-Data, built upon nuScenes and CARLA, specifically designed for driving with language. The project also offers DriveLM-Agent, a VLM-based baseline approach for jointly performing GVQA and end-to-end driving. DriveLM serves as a main track in the CVPR 2024 Autonomous Driving Challenge, offering a baseline, test data, submission format, and evaluation pipeline. It addresses the community's challenges by providing a benchmark for driving with language, exploring embodied applications of LLMs/VLMs, and investigating closed-loop planning with language.

Composo

58%

Composo is a quality layer for production AI, designed to identify and rectify silent AI failures before they impact customers. It connects to production traces to generate a detailed failure report, categorizing issues by type, severity, and frequency. The system learns from domain expert corrections, adapting to evolving quality standards and improving over time. Composo replaces lengthy internal evaluation infrastructure builds, deploying in 2-4 weeks compared to 3-6 months. It creates custom failure taxonomies for specific domains, leveraging insights from over 30 deployments across various industries. Confirmed failure patterns are converted into guardrails that block bad outputs at runtime with sub-second latency, ensuring quality enforcement on 100% of outputs.

DeepRobust

58%

DeepRobust is a comprehensive PyTorch adversarial library designed for both attack and defense methods across image and graph domains. It offers a robust toolkit for researchers and engineers to develop and evaluate the resilience of machine learning models against adversarial attacks. The library includes various algorithms for generating adversarial examples and implementing defense strategies, with continuous updates adding new attacks like UGBA for backdoor attacks on graphs and PRBCD for scalable graph attacks. DeepRobust also supports robust models like AirGNN and provides tools for converting datasets between PyTorch Geometric and DeepRobust, making it a versatile platform for adversarial machine learning research.

pytorch_diffusion

58%

pytorch_diffusion offers a PyTorch reimplementation of Denoising Diffusion Probabilistic Models, complete with checkpoints converted from the original TensorFlow implementation. This tool allows users to load diffusion models with pretrained weights for various datasets like CIFAR-10, LSUN-bedroom, LSUN-cat, and LSUN-church. It provides a quickstart guide for running a Streamlit demo, making it accessible for immediate use. Users can also instantiate and configure the U-Net model for denoising independently. The repository includes instructions for producing samples, evaluating results against TensorFlow models, and converting TensorFlow checkpoints to PyTorch, making it a comprehensive resource for researchers and developers working with diffusion models.

pysc2-examples

58%

pysc2-examples offers a collection of Deep Reinforcement Learning examples specifically designed for StarCraft II. Built upon Deepmind's pysc2, OpenAI's baselines, and Blizzard's s2client-proto, it provides a robust framework for developers and researchers. The project leverages TensorFlow 1.3 and includes examples for tasks like 'CollectMineralShards' using Deep Q Networks and A2C algorithms. Users can quickly set up the environment, install necessary libraries like pysc2 and baselines, download StarCraft II maps, and then train and enjoy their AI agents. It supports various parameters for training, including algorithm choice (deepq, a2c), total timesteps, exploration fraction, and options for prioritized replay or dueling networks.

SnakeFusion

58%

SnakeFusion is an AI project that leverages genetic algorithms and neural networks to train virtual snakes within a game environment. The core concept involves training five individual snakes, which can then be fused together to create a single, more advanced 'ultimate snake'. This project serves as a practical demonstration of applying evolutionary algorithms and AI in game development. Built using Processing, it provides a hands-on approach to understanding how AI can learn and adapt. Users can interact with the system by adjusting mutation rates, saving trained snakes, and initiating the fusion process to observe the creation of a 'super snake'.

Setup-NVIDIA-GPU-for-Deep-Learning

58%

Setup-NVIDIA-GPU-for-Deep-Learning is a comprehensive, open-source guide designed to assist users in setting up their NVIDIA GPUs for deep learning tasks. It outlines a clear, step-by-step process, starting with the installation of the latest NVIDIA GPU drivers. The guide then proceeds to cover essential software components such as Visual Studio with C++ support, Anaconda/Miniconda for package management, the CUDA Toolkit, and cuDNN. Finally, it provides instructions for installing PyTorch and includes a script to test the GPU setup, ensuring all components are correctly configured for optimal deep learning performance. This resource is invaluable for deep learning practitioners and AI researchers looking to streamline their development environment setup.

tf-gnn-samples

58%

tf-gnn-samples is a GitHub repository offering TensorFlow implementations of various Graph Neural Network (GNN) architectures. It serves as the code release for an article introducing GNNs with feature-wise linear modulation (GNN-FiLM). The repository includes implementations for Gated Graph Neural Networks (GGNN), Relational Graph Convolutional Networks (RGCN), Relational Graph Attention Networks (RGAT), Relational Graph Isomorphism Networks (RGIN), GNN-Edge-MLP, and Relational Graph Dynamic Convolution Networks (RGDCN). It provides scripts for training and evaluating models on tasks such as citation networks (Cora, Pubmed, Citeseer), protein-protein interaction (PPI), quantum chemistry prediction (QM9), and variable misuse detection (VarMisuse). The code allows users to reproduce experimental results presented in the accompanying research paper, making it a valuable resource for researchers and developers working with GNNs.

vector-python-sdk

58%

The Anki Vector Python SDK is an open-source toolkit that enables developers to program and control the Anki Vector robot using Python. It provides a comprehensive set of tools and documentation to facilitate the setup and integration of the Vector robot into various projects. The SDK is hosted on GitHub, indicating its community-driven nature and accessibility for contributions. It includes examples to help users get started and offers resources like an official SDK documentation and forums for support. This SDK is ideal for those looking to explore robotics, AI, and vision capabilities through the Anki Vector platform.

torchdrug

58%

TorchDrug is a robust, PyTorch-based machine learning platform specifically designed for drug discovery. It simplifies the implementation of graph operations in a PyTorchic style with GPU support, making it accessible even for practitioners with minimal drug discovery knowledge. The platform facilitates rapid prototyping of machine learning research by providing a wide range of common datasets and building blocks. Users can easily work with graph-structured data and molecules, extracting properties without deep domain expertise. TorchDrug also accelerates training and inference across multiple CPUs or GPUs, offering seamless scalability for complex experiments. It supports integration with Weights & Biases for experiment tracking and management.

tensorflow-triplet-loss

58%

Tensorflow-triplet-loss offers a robust implementation of triplet loss within the TensorFlow framework, specifically designed for metric learning tasks. It includes online triplet mining capabilities, which are crucial for training models that learn meaningful embeddings. The repository provides two main versions: "batch all" and "batch hard" triplet loss, allowing flexibility in how triplets are selected and processed. The code structure is adapted from CS230 assignments and is accompanied by tutorials, making it accessible for developers and researchers. It supports both CPU and GPU installations and includes scripts for training on datasets like MNIST, visualizing embeddings, and hyperparameter searching. This tool is ideal for those looking to implement or experiment with triplet loss for tasks such as face recognition or person re-identification.

ThreeDPoseUnityBarracuda

58%

ThreeDPoseUnityBarracuda is an open-source Unity sample project designed for 3D pose estimation, leveraging the Barracuda neural network inference library. This tool allows developers to implement real-time motion capture, enabling an avatar (like Unity-chan) to mimic human movements from a video input. It supports loading ONNX models for improved accuracy and provides options for choosing target videos, avatars, and even using a web camera for input. While the project is not actively maintained, it serves as a valuable foundation for integrating advanced pose estimation capabilities into Unity-based game development and other interactive applications. Users can customize avatar sizes and input sources, making it a flexible starting point for various motion-related projects.

Cenozic

58%

Cenozic is a premier digital IT services company offering comprehensive technology solutions to accelerate digital transformation for businesses. They provide expert custom software development, web and mobile app development, UI/UX design, and DevOps services across various industries, including healthcare, fintech, and retail. Cenozic focuses on delivering scalable, high-performance solutions using cutting-edge technologies like Node.js, Python, React, and AI frameworks such as TensorFlow and OpenAI. Their services encompass everything from concept and development to deployment and ongoing support, ensuring efficiency, innovation, and customer satisfaction. They are committed to helping businesses harness the power of technology to drive growth and stay ahead in the digital landscape.

X2Paddle

58%

X2Paddle is a deep learning model conversion tool developed under the PaddlePaddle ecosystem, designed to help users of other deep learning frameworks quickly migrate their models and projects to PaddlePaddle. It supports the conversion of prediction models from major frameworks like Caffe, TensorFlow, ONNX, and PyTorch. Additionally, X2Paddle facilitates the migration of entire PyTorch training projects, including both training and prediction code, to the PaddlePaddle framework. The tool offers detailed API comparison documentation to reduce the time and effort developers spend on migrating models. It boasts support for a wide range of models, covering over 130 PyTorch OPs, 90 ONNX OPs, 90 TensorFlow OPs, and 30 Caffe OPs, making it a comprehensive solution for model migration.

Score Jacobian Chaining

58%

Score Jacobian Chaining is a technique designed for analyzing the sensitivity of machine learning models. This tool is invaluable for AI researchers and machine learning engineers seeking to understand the intricate relationship between model inputs and outputs. By providing insights into how changes in input data propagate through a model, it facilitates effective debugging and optimization. This understanding is crucial for improving model performance, ensuring robustness, and gaining deeper insights into model behavior. While the current live website indicates a runtime error, the underlying concept is highly relevant for academic research and practical application in machine learning development.

nexent

58%

Nexent is a zero-code platform designed for auto-generating production-grade AI agents, leveraging Harness Engineering principles. It provides a unified approach to tools, skills, memory, and orchestration, incorporating built-in constraints, feedback loops, and control planes. The platform eliminates the need for complex orchestration or drag-and-drop interfaces, allowing users to develop any agent using pure language. Key features include smart agent prompt generation, scalable data processing for over 20 data formats, personal-grade and internet knowledge search with traceability, and multimodal understanding. It also boasts an MCP tool ecosystem for flexible integration of Python plug-ins, models, and chains without core code modification.

SmolVLM 256M Instruct WebGPU

58%

SmolVLM 256M Instruct WebGPU is an AI model developed by Hugging Face Smol Models Research, designed to provide instant visual descriptions. Users can upload a photo, and the application will generate a short text caption summarizing the image in clear, natural language. This tool operates entirely within a web browser, eliminating the need for any special setup or installations. It is particularly useful for quickly understanding the content of an image through an AI-generated description, making it accessible for a wide range of users who need immediate visual interpretation without complex configurations. The model is available as a Hugging Face Space, emphasizing its accessibility and ease of use.

BinaryNet.pytorch

58%

BinaryNet.pytorch offers a PyTorch implementation of Binarized Neural Networks (BNN), specifically designed for VGG and ResNet models. This open-source tool allows researchers and developers to delve into the world of binarized neural networks, which are known for their efficiency in terms of memory and computational resources. The project is hosted on GitHub and provides the necessary code to run models like resnet18 for datasets such as cifar10. It serves as a valuable resource for those looking to understand, implement, or experiment with BNNs within the PyTorch framework, building upon existing work in the field.

chatgpt-ai-template

58%

Horizon ChatGPT AI Template is an open-source ChatGPT UI AI Template and Starter Kit designed for developers using React, NextJS, and Chakra UI. This template provides a comprehensive foundation for building AI web applications, featuring over 30 dark/light frontend elements such as buttons, inputs, navbars, and cards. It aims to accelerate the development of Chat AI SaaS Apps by offering a pre-built, customizable user interface. The template includes detailed documentation and a quick-start guide for easy installation and setup. Users need an OpenAI API key with billing information to ensure full functionality. An example page is also provided for inspiration and rapid prototyping.

d2l-tvm

58%

d2l-tvm is an open-source project dedicated to deep learning compilers, offering comprehensive resources for those looking to understand and optimize deep learning models. Hosted on GitHub, it provides a platform for learning about the TVM deep learning compiler stack. The project includes detailed documentation, practical examples, and guides on how to contribute, making it a valuable resource for developers and researchers. It covers various aspects of deep learning compilation, from common operators and CPU/GPU schedules to deployment strategies, enabling users to dive deep into the technical intricacies of optimizing AI models.

Ray 3.0

58%

Ray 3.0 is a comprehensive debugging tool designed to streamline the development process by organizing all debug output in a dedicated desktop application. It eliminates the need for debug output to clutter your application or browser, providing a clean and interactive interface. Ray supports a wide range of languages and frameworks, including PHP, Laravel, JavaScript, Node.js, Vue.js, React, WordPress, and more, allowing developers to use the same debugging syntax across different environments. Key features include remote debugging over SSH, message archiving for later reference, and powerful tools to pause and measure code execution. The latest version, Ray 3.0, introduces enhanced AI integration, enabling users to interact with AI-generated HTML components, Mermaid, and ERD diagrams directly within the app, making it an invaluable tool for modern development workflows.

EXPLORE OTHER CATEGORIES

🎨 Content & Design 📊 Productivity & Business 💻 Coding & Development 📚 Research & Education 🧘 Wellness & Lifestyle 💼 Career Development 📈 Marketing & Growth 📉 Data & Analytics 💬 Customer Support & CX 💰 Finance 🛒 E-commerce