AI Agents & Automation
Browsing page 513 of AI Agents & Automation. Sorted by confidence score — our independent quality rating.
Applying_EANNs
Applying_EANNs is a 2D Unity simulation designed to showcase how cars can learn to navigate various courses. The cars are controlled by a feedforward neural network, whose weights are optimized using a modified genetic algorithm. This project provides a practical demonstration of evolutionary artificial neural networks in a simulated environment. Users can tinker with simulation parameters in the Unity Editor or run the built executable with default settings. The neural network architecture includes an input layer, two hidden layers, and an output layer, with its training managed by a customizable genetic algorithm. The user interface displays real-time data for the best performing car, including neural network output, evaluation value, and a generation counter, along with a visual representation of the neural network's weights.
legged_gym
legged_gym offers Isaac Gym environments specifically designed for training legged robots to walk on rough terrain. This open-source repository provides all necessary components for successful sim-to-real transfer, including an actuator network, friction and mass randomization, noisy observations, and random pushes during training. It supports the development and testing of robust robot control algorithms, with a focus on real-world applicability. The platform allows users to define and train new environments, customize robot assets, and fine-tune training parameters. While the project has migrated to Isaac Lab for future updates, this repository remains a valuable resource for those working with Isaac Gym Preview 3.
Instruction Synthesizer
Instruction Synthesizer is an AI tool hosted on Hugging Face Spaces, designed to generate structured instruction-response pairs from input text. Users can enter any text, and the application will process it to create organized instruction sets and their corresponding responses. This functionality is particularly useful for tasks requiring the extraction or creation of structured data from unstructured text, such as developing educational content, automating certain text-based processes, or preparing data for training other AI models. The tool aims to simplify the process of converting free-form text into a more actionable and organized format, making it accessible for various applications.
Awesome-Adaptation-of-Agentic-AI
Awesome-Adaptation-of-Agentic-AI is a curated repository featuring a comprehensive list of academic papers focused on the adaptation strategies of agentic AI systems. This resource is designed for researchers and practitioners interested in the evolving field of agentic AI, offering insights into various adaptation methods. The repository categorizes papers based on agent adaptation (tool execution signaled, agent output signaled) and tool adaptation (agent-agnostic, agent-supervised), detailing development timelines, methods, venues, tasks, tools, agent backbones, and tuning techniques. It serves as a valuable reference for understanding the latest advancements and research trends in making AI agents more adaptive and intelligent.
Augtech NextWealth IT Services Private Limited
Augtech NextWealth IT Services Private Limited is an ISO 9001:2015 certified organization providing Information Technology and Information Technology Enabled Services. They focus on delivering world-class "Data Enrichment" and "Customer Interaction" services to clients in AI/ML tech, E-commerce, Fin-Tech, Education, and other sectors. Their expertise includes data collection from diverse sources, data preparation involving cleansing, consolidation, normalization, and validation, and data enrichment for AI/ML models, including multimedia annotation. The company also offers customer service operations, including inbound and outbound support. Augtech NextWealth is a social impact organization committed to providing opportunities to talent in Tier-2 and Tier-3 ecosystems.
chess-alpha-zero
chess-alpha-zero is an open-source project dedicated to chess reinforcement learning, implementing methods inspired by DeepMind's AlphaGo Zero. It allows users to train AI models to play chess through self-play, supervised learning, and distributed training. The project provides a modular architecture with 'self' for data generation, 'opt' for model training, and 'eval' for model evaluation. It supports Python 3.6.3, TensorFlow-GPU, and Keras, making it suitable for developers and researchers interested in AI game development and machine learning applications in strategic games. The tool also offers a Universal Chess Interface (UCI) for integration with chess GUIs, allowing users to observe and interact with the trained AI.
ChatPal
ChatPal provides a unique approach to AI agents and automation by operating entirely within your browser. This means there's no need for user logins, API keys, or concerns about data privacy, as all processing occurs locally on your device. The tool is designed for users who prioritize data security and want to experiment with AI capabilities without external dependencies. By keeping everything client-side, ChatPal ensures that sensitive information remains private and under the user's control, making it an ideal solution for personal projects or secure environments where data sovereignty is paramount. It aims to make local AI accessible and easy to use for a broad audience.
Miniworld
MiniWorld is a minimalistic 3D interior environment simulator specifically designed for reinforcement learning and robotics research. It allows users to simulate environments featuring rooms, doors, hallways, and various objects, making it suitable for tasks like training AI agents in office, home, or maze-like settings. Written 100% in Python, MiniWorld is easily modifiable and extensible, offering features such as few dependencies, good performance, lightweight design, and support for domain randomization for sim-to-real transfer. It also provides fully observable top-down views, depth map production, and the ability to display alphanumeric strings on walls. This project has been deprecated as of August 11, 2025, and is no longer receiving updates or support.
Maya Demo
Maya Demo is an interactive AI tool hosted on Hugging Face that enables users to engage in conversations about uploaded images. Users can upload an image and then chat with the AI, which generates responses based on the visual content and the ongoing dialogue. The tool supports a wide range of languages including English, Spanish, Hindi, Chinese, Japanese, French, Russian, and Arabic, making it accessible to a global audience. It's designed for straightforward interaction, requiring users to upload an image before initiating a chat. The platform is currently in a 'sleeping' state due to inactivity, indicating it's a demonstration or experimental project.
Medgemma 27b Text It
Medgemma 27b Text It is an AI chatbot designed to provide medically-informed responses to user queries. Users can input a system prompt to guide the AI's persona or focus, followed by their specific medical question. The tool then generates a detailed response, making it ideal for individuals seeking expert advice on various health-related topics. While the live website currently indicates a runtime error, the intended functionality is to offer a conversational interface for medical information. It is hosted on Hugging Face, suggesting an accessible platform for its use.
DeepCTR-Torch
DeepCTR-Torch is a comprehensive, open-source Python package designed for building and experimenting with deep learning-based Click-Through Rate (CTR) models, leveraging the PyTorch framework. It offers a modular and extensible architecture, allowing users to easily implement and customize a wide range of CTR models, including popular architectures like DeepFM, xDeepFM, and Wide & Deep. The package includes numerous core component layers, enabling data scientists and researchers to construct their own custom models efficiently. With its user-friendly API, DeepCTR-Torch simplifies the process of training and predicting with complex models using standard `model.fit()` and `model.predict()` functions, making it an invaluable tool for recommendation systems and advertising applications.
dl-docker
dl-docker offers an all-in-one Docker image designed for deep learning, simplifying the setup process by pre-packaging popular frameworks such as TensorFlow, Caffe, Theano, Keras, and Torch. It supports both CPU and GPU configurations, with the GPU version including CUDA 8.0 and cuDNN v5. The image also comes with essential libraries like iPython/Jupyter Notebook, Numpy, SciPy, Pandas, Scikit Learn, Matplotlib, and OpenCV. Users can either pull pre-built CPU images from Docker Hub or build both CPU and GPU versions locally. This solution addresses the 'dependency hell' often encountered when installing multiple deep learning frameworks, providing an isolated and fully functional OS environment for development.
DriveLM
DriveLM is an open-source project focused on advancing autonomous driving research through Graph Visual Question Answering (GVQA). It provides comprehensive datasets, DriveLM-Data, built upon nuScenes and CARLA, specifically designed for driving with language. The project also offers DriveLM-Agent, a VLM-based baseline approach for jointly performing GVQA and end-to-end driving. DriveLM serves as a main track in the CVPR 2024 Autonomous Driving Challenge, offering a baseline, test data, submission format, and evaluation pipeline. It addresses the community's challenges by providing a benchmark for driving with language, exploring embodied applications of LLMs/VLMs, and investigating closed-loop planning with language.
gans-in-action
gans-in-action is the official companion repository for the book 'GANs in Action: Deep Learning with Generative Adversarial Networks' by Jakub Langr and Vladimir Bok. This open-source resource allows users to reproduce, study, and extend every hands-on example from the book. It features Jupyter notebooks that walk through major variants in the GAN family, from vanilla GANs to CycleGANs, implemented using Keras/TensorFlow. The repository covers fundamental concepts of generative modeling, adversarial training, and best practices for stable GAN training. It includes implementations of architectures like DCGAN, Progressive GAN, Semi-Supervised GAN, and Conditional GAN, along with educational resources and canonical GAN papers.
everything-chatgpt
everything-chatgpt is an open-source project hosted on GitHub that delves into the technical underpinnings of the ChatGPT web application. It offers a detailed exploration of various components, including backend API calls, data structures like session and user data, and model information. The project also covers beta features such as custom instructions, code interpreter, and plugins, along with how chat history and data export functionalities work. It's a valuable resource for developers, researchers, and anyone interested in understanding the technical architecture and operational mechanics of ChatGPT.
n8n-docs
n8n-docs serves as the official documentation repository for n8n, a fair-code licensed automation tool. It offers comprehensive resources for both the free community edition and powerful enterprise options, guiding users on how to effectively connect various applications and build automated workflows. The documentation specifically highlights how to integrate and build AI functionality into these workflows, making it a valuable resource for developers and technical users looking to leverage n8n's capabilities. It includes detailed guides on setting up local previews, troubleshooting common issues, and contributing to the documentation itself, ensuring a smooth experience for both new and experienced users.
MyIP
MyIP is a comprehensive, open-source IP Toolbox designed for detailed network analysis and diagnostics. It enables users to easily view their local and public IP addresses, perform IP geolocation lookups, and conduct essential network tests such as DNS leak detection and WebRTC connection examination. The tool also includes speed tests, ping tests, and MTR tests to assess network performance and connectivity. Additionally, MyIP offers website availability checks, WHOIS searches for domain and IP information, MAC lookups, and browser fingerprint analysis. It supports multiple languages, dark mode, a minimalist mobile-optimized mode, and PWA installation, making it a versatile solution for network professionals and users concerned with their online privacy and connectivity.
dynamax
Dynamax is a Python package designed for probabilistic state space modeling, leveraging the JAX library for efficient computation. It offers robust capabilities for both inference (state estimation) and learning (parameter estimation) across a range of state space models. These include Hidden Markov Models (HMMs), Linear Gaussian State Space Models (Linear Dynamical Systems), Nonlinear Gaussian State Space Models, and Generalized Gaussian State Space Models with non-Gaussian emission models. The library provides both low-level, functionally pure inference algorithms and a user-friendly, object-oriented interface through model classes. It is compatible with other JAX ecosystem libraries like Optax for stochastic gradient descent and Blackjax for Hamiltonian Monte Carlo or sequential Monte Carlo.
eyeballer
Eyeballer is a convolutional neural network designed by Bishop Fox for analyzing penetration testing screenshots. It helps security professionals identify "interesting" targets from a vast collection of web-based hosts, particularly useful in large-scope network penetration tests. Users can employ their favorite screenshotting tools like EyeWitness or GoWitness, then process the outputs through Eyeballer to categorize them. The tool labels screenshots into categories such as "Old-Looking Sites" (indicating potential vulnerabilities), "Login Pages" (suggesting further functionality and credential enumeration opportunities), "Webapp" (signifying a larger attack surface), "Custom 404's" (to filter out uninteresting pages), and "Parked Domains" (to remove invalid attack surfaces from scope). Eyeballer provides results in both human-readable HTML and machine-readable CSV formats, offering performance metrics like Overall Binary Accuracy and All-or-Nothing Accuracy.
interpret
InterpretML is an open-source Python package designed to bring clarity to machine learning models. It provides a unified framework for state-of-the-art interpretability techniques, enabling users to both train inherently interpretable 'glassbox' models like the Explainable Boosting Machine (EBM) and explain complex 'blackbox' systems using methods like SHAP and LIME. This tool is crucial for tasks such as model debugging, feature engineering, detecting fairness issues, and ensuring regulatory compliance in high-risk applications. It supports various data types natively and offers functionalities for global model understanding as well as explanations for individual predictions, making it a comprehensive solution for data scientists and machine learning engineers.
Introduction_to_Machine_Learning
Introduction_to_Machine_Learning is a comprehensive GitHub repository offering educational materials for the Machine Learning course at Sharif University of Technology. This resource provides students and machine learning enthusiasts with access to detailed slides, interactive Jupyter notebooks for practical application, and various exercises to reinforce learning. Users can also find materials from previous semesters, ensuring a rich and evolving learning experience. Additional resources, including class videos, are available on the SharifML website (in Persian). The content is freely usable, with a request for proper citation of both the course and the GitHub repository under a Creative Commons BY license.
Gymnasium
Gymnasium is an open-source Python library designed for developing and comparing reinforcement learning algorithms. It offers a standardized API for communication between learning algorithms and environments, alongside a comprehensive set of API-compliant environments. This library is a fork of OpenAI's Gym, maintained by the original team, ensuring continued development and support. It includes diverse environment families such as Classic Control, Box2D, Toy Text, MuJoCo, and Atari, catering to various complexity levels and problem types. Gymnasium also supports third-party environments and provides strict versioning for reproducibility. It is an essential tool for researchers and developers in the reinforcement learning field.
h1st
h1st offers power tools for AI engineers, championing a Human-First AI approach to address critical challenges in real-world data science. It helps overcome data scarcity in industrial AI by combining human knowledge with available data, enabling earlier time-to-market for intelligent systems. The platform fosters collaboration among data scientists by breaking down complex modeling problems into smaller, manageable parts, similar to established software engineering methodologies. Furthermore, h1st supports transparent and trustworthy AI by providing model description and explanation at multiple layers, which is crucial for deployment and regulatory compliance. It runs on Python 3.8 or above and can be easily installed via pip.
HybrIK
HybrIK is an open-source project offering a hybrid analytical-neural inverse kinematics (IK) solution for 3D human pose and shape estimation. It provides the official code for the research papers "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation" (CVPR 2021) and "HybrIK-X: Hybrid Analytical-Neural Inverse Kinematics for Whole-body Mesh Recovery" (TPAMI 2025). The tool allows users to convert accurate 3D keypoints into parametric body meshes. Key features include demo code for visualizing HybrIK on videos and images, support for both SMPL and SMPL-X models, and a Blender add-on for importing results as FBX files. It also supports multi-person demos with pose-tracking and provides pretrained models with various backbones.