ShypdShypd.ai
💻

Coding & Development

Browsing page 360 of AI tools for Coding & Development. Sorted by confidence score — our independent quality rating.

ml-workspace

ml-workspace

58%

ml-workspace is a comprehensive web-based Integrated Development Environment (IDE) designed specifically for machine learning and data science tasks. It offers a streamlined deployment process, allowing users to quickly set up and begin building ML solutions on their own machines. The workspace comes pre-loaded with a wide array of popular data science libraries such as Tensorflow, PyTorch, Keras, and Scikit-learn, alongside essential development tools like Jupyter, VS Code, and Tensorboard. These tools are perfectly configured, optimized, and integrated to provide a productive environment. Key features include web-based access to Jupyter, JupyterLab, and Visual Studio Code, a full Linux desktop GUI via web browser, seamless Git integration optimized for notebooks, and integrated hardware and training monitoring via Tensorboard and Netdata. It supports easy deployment on Mac, Linux, and Windows via Docker.

NVTabular

NVTabular

58%

NVTabular is a powerful feature engineering and preprocessing library specifically designed for tabular data, enabling the manipulation of terabyte-scale datasets. It accelerates computation on the GPU using the RAPIDS Dask-cuDF library, making it ideal for training deep learning-based recommender systems. As a core component of NVIDIA Merlin, it seamlessly integrates with other Merlin tools like Merlin Models, HugeCTR, and Merlin Systems to provide end-to-end acceleration for recommender systems on the GPU. NVTabular addresses challenges such as processing huge datasets, managing complex data pipelines, and overcoming input bottlenecks, allowing data scientists and ML engineers to focus on data transformation rather than scaling issues. It significantly reduces the time required for feature engineering and preprocessing, with reported completion times of 13 minutes on a single V100 GPU and 3 minutes on a DGX-1 cluster for the Criteo 1TB Click Logs Dataset.

365-Days-Computer-Vision-Learning-Linkedin-Post

365-Days-Computer-Vision-Learning-Linkedin-Post

58%

365-Days-Computer-Vision-Learning-Linkedin-Post is an open-source GitHub repository curated by Ashish Patel, offering a comprehensive, day-by-day learning journey through various computer vision concepts and models. Each entry in the repository corresponds to a LinkedIn post, providing a concise overview and a link to further resources on topics ranging from EfficientDet and YOLO Series to Vision Transformers, GANs, and advanced segmentation techniques. This resource is ideal for individuals looking to deepen their understanding of computer vision through a structured, accessible format, leveraging the power of community learning and readily available information.

3d-bat

3d-bat

58%

3D-BAT (3D Bounding Box Annotation Tool) is an open-source, web-based platform designed for annotating 3D bounding boxes on point cloud and image data. It offers a comprehensive suite of features for efficient and accurate data labeling, including AI-assisted labeling, batch-mode editing, and interpolation for sequences. The tool supports full-surround annotations, 3D to 2D label transfer, automatic tracking, and various viewing options like side views and perspective/orthographic editing. With capabilities for custom dataset, class, and attribute support, along with HD map integration and OpenLABEL compatibility, 3D-BAT is ideal for researchers and developers working with multi-sensor data in fields like autonomous driving and robotics. It also includes features like auto-save, redo/undo, and keyboard-only annotation for a streamlined workflow.

multimodal-agents-course

multimodal-agents-course

58%

multimodal-agents-course is a free, open-source educational program designed to teach developers how to build advanced AI agents. The course focuses on creating agents that can process and understand multimodal data, including images, text, audio, and videos. Participants will learn to build an MCP (Model Context Protocol) server for video processing using Pixeltable and FastMCP, design Groq-powered agents, and integrate systems with Opik for observability and prompt versioning. The curriculum emphasizes practical, hands-on implementation, covering topics like complex multimodal processing pipelines, video search engines, and LLMOps principles, making it suitable for ML/AI engineers, software engineers, and data engineers/scientists.

AICommit

AICommit

58%

AICommit is a powerful JetBrains plugin designed to streamline the commit message generation process for developers. Compatible with popular IDEs such as IntelliJ IDEA and WebStorm, it allows users to generate precise AI commit messages with a single click directly within their development workflow. The tool supports a range of AI providers including OpenAI, Azure OpenAI, Google Gemini, Anthropic Claude, and Ollama for local models, offering flexibility and privacy controls. Developers can utilize built-in templates or create custom ones for specific needs like Conventional Commits. AICommit emphasizes privacy by processing diffs locally before any API calls, ensuring no code is stored, logged, or shared, making it suitable for teams with strict data security requirements.

One-DM

One-DM

58%

One-DM, or One-Shot Diffusion Mimicker, is an open-source AI tool designed for stylized handwritten text generation. It stands out by requiring only a single reference sample as style input to imitate a user's writing style and generate new handwritten text with arbitrary content. This addresses a common challenge in previous methods that struggled with accurate style extraction from limited samples. One-DM enhances style extraction by incorporating high-frequency components from the reference sample, effectively capturing writing patterns while suppressing background noise. Extensive experiments across English, Chinese, and Japanese handwriting datasets demonstrate its superior performance, even outperforming methods that use significantly more reference samples. The project provides code, datasets, and pre-trained models for easy setup and use.

pytorch-grad-cam

pytorch-grad-cam

58%

pytorch-grad-cam is an advanced AI explainability package for computer vision, built on PyTorch. It offers a comprehensive collection of Pixel Attribution methods, including GradCAM, HiResCAM, ScoreCAM, and many others, to help diagnose model predictions and understand their decision-making process. The tool supports a wide range of architectures, from common CNNs to Vision Transformers, and can be applied to advanced use cases such as classification, object detection, semantic segmentation, and embedding-similarity. It includes smoothing methods like `aug_smooth` and `eigen_smooth` to produce clearer CAMs, and boasts high performance with full support for batches of images. Additionally, pytorch-grad-cam provides metrics for evaluating the trustworthiness and performance of explanations, making it valuable for both model development and research into new explainability methods.

python-utcp

python-utcp

58%

python-utcp is the official Python implementation of the Universal Tool Calling Protocol (UTCP), an open standard designed to allow AI agents to call any API directly, eliminating the need for additional middleware. It emphasizes scalability, extensibility, and interoperability, supporting a wide range of communication protocols through a modular, plugin-based architecture. Developers can easily integrate new protocols like HTTP, SSE, CLI, and more, or add custom tool storage and search strategies. The protocol is built on simple, well-defined Pydantic models, making it straightforward for developers to implement and use. This repository provides the core UTCP package, along with various protocol-specific plugins, and offers clear migration guides and usage examples for quick adoption.

pymarl

pymarl

58%

PyMARL is a Python-based, open-source framework developed by WhiRL for deep multi-agent reinforcement learning. It provides implementations of several prominent algorithms, including QMIX for monotonic value function factorisation, COMA for counterfactual multi-agent policy gradients, VDN for value-decomposition networks, IQL for independent Q-learning, and QTRAN for learning to factorize with transformation. The framework is built using PyTorch and integrates with SMAC (StarCraft Multi-Agent Challenge) as its environment, specifically using SC2.4.6.2.69232 for the results in the SMAC paper. PyMARL supports saving and loading trained models, as well as watching StarCraft II replays, making it a comprehensive tool for researchers and developers in the multi-agent RL domain.

PoseFormer

PoseFormer

58%

PoseFormer is an open-source project that provides an official implementation of the paper "3D Human Pose Estimation with Spatial and Temporal Transformers," accepted at ICCV 2021. This tool is designed for researchers and developers working in the field of computer vision and human pose estimation. It offers code built on VideoPose3D, allowing users to evaluate pre-trained models with both CPN detected and ground truth 2D poses as input. Additionally, PoseFormer supports training new models from scratch, with configurable frame inputs to achieve varying levels of accuracy. The repository also links to related works like Context-Aware PoseFormer (NeurIPS 2023) and PoseFormerV2 (CVPR 2023), indicating ongoing research and development in this area.

RQ-VAE-Recommender

RQ-VAE-Recommender

58%

RQ-VAE-Recommender offers a PyTorch implementation of a generative retrieval model, specifically designed for recommender systems. The model operates in two stages: first, it maps items in a corpus to a tuple of semantic IDs by training an RQ-VAE. Second, it tokenizes sequences of these semantic IDs using a frozen RQ-VAE and then trains a transformer-based model to predict the next IDs in the sequence. This approach is based on the research presented in "Recommender Systems with Generative Retrieval." It supports various datasets, including Amazon Reviews (Beauty, Sports, Toys), MovieLens 1M, and MovieLens 32M, and provides both RQ-VAE and decoder-only retrieval model training scripts. Pre-trained checkpoints are available on Hugging Face for Amazon Beauty.

pytriton

pytriton

58%

PyTriton is a Flask/FastAPI-like framework designed to streamline the use of NVIDIA's Triton Inference Server within Python environments. It allows developers to serve machine learning models with ease, supporting direct deployment from Python. Key features include native Python support for exposing any Python function as an HTTP/gRPC API, framework-agnostic operation compatible with PyTorch, TensorFlow, or JAX, and performance optimizations like dynamic batching, response caching, and model pipelining. The tool also provides decorators for handling batching and pre-processing, high-level model clients for HTTP/gRPC requests, and alpha support for streaming partial responses.

rosa

rosa

58%

ROSA (Robot Operating System Agent) is an AI Agent developed by NASA JPL, designed to facilitate interaction with ROS1- and ROS2-based robotics systems through natural language queries. Built on the Langchain framework, ROSA empowers robot developers to inspect, diagnose, understand, and operate robots more efficiently. It supports custom agent creation, allowing for adaptation to various robots and environments, and offers features like identifying topics with publishers but no subscribers. The tool includes a TurtleSim demo for controlling a simulated robot and is actively developing an IsaacSim extension for direct integration and control within the simulation environment.

Reproducible-Deep-Compressive-Sensing

Reproducible-Deep-Compressive-Sensing

58%

Reproducible-Deep-Compressive-Sensing is a comprehensive collection of source code dedicated to deep learning-based compressive sensing (DCS). This repository categorizes and provides access to numerous research works, offering links to their respective source code, PDF papers, and DOIs. The collection is organized based on key characteristics such as sampling matrix type (frame-based/block-based), sampling scale (single scale, multi-scale), and the deep learning platform used. It also includes code for image and video reconstruction, as well as other related applications. This resource is invaluable for researchers and developers looking to explore, reproduce, or build upon existing deep learning models in compressive sensing.

snorkel

snorkel

58%

Snorkel is an open-source system designed for the rapid generation of training data using weak supervision. Originating from Stanford in 2015, the project aimed to bring mathematical and systems structure to the often manual process of training data creation. It empowers users to programmatically label, build, and manage training data, addressing the critical role of data quality in machine learning project success. While the original Snorkel project is no longer actively developed, its core ideas and techniques have evolved into Snorkel Flow, an end-to-end AI application development platform. Snorkel is particularly useful for developers and data scientists looking to efficiently create large, labeled datasets for various machine learning tasks.

Chatterbox Labs

Chatterbox Labs

58%

Red Hat is a leading provider of enterprise open source solutions, offering a comprehensive suite of technologies for Linux, cloud, containerization, and Kubernetes. The platform supports hybrid cloud innovation with Red Hat Enterprise Linux and enables scalable application development with Red Hat OpenShift. For AI, Red Hat offers specialized products like Red Hat AI Enterprise, Red Hat AI Inference Server, and Red Hat OpenShift AI, designed to help businesses build, deploy, and monitor AI models and applications efficiently. The company emphasizes a community-powered approach, collaborating with open source communities to develop secure, stable, and innovative technologies, and provides extensive support, training, and consulting services.

Data Wizards

Data Wizards

58%

Data Wizards is an AI consulting firm specializing in helping corporates and ambitious SMEs unlock their business potential through expert AI solutions. They provide comprehensive services including AI strategy development, AI solution and development, and AI education. Data Wizards builds high-performing AI solutions to overcome challenges, streamline operations, and identify new growth opportunities. Their expertise spans various industries such as Automotive, Retail, Pharmaceutical, Manufacturing, Insurance, Financial, Logistics, Energy, Healthcare, Telecommunications, Media, SMEs, Security, Commodity, and Food, offering tailored applications like predictive maintenance, sales forecasts, customer churn analysis, and fraud detection.

swin2sr

swin2sr

58%

swin2sr is an open-source AI tool leveraging the SwinV2 Transformer for advanced image super-resolution and restoration. It excels at reducing JPEG compression artifacts and upscaling images, offering state-of-the-art performance in classical, lightweight, and real-world image super-resolution. The tool is particularly effective for compressed input scenarios, addressing common issues like training instability and resolution gaps in transformer vision models. It provides code, pre-trained models, and demos, making it suitable for both research and practical applications in image processing and low-level vision. Demos are available on platforms like Kaggle, Google Colab, and Huggingface Spaces.

Deep-RL-Notes

Deep-RL-Notes

58%

Deep-RL-Notes offers a comprehensive collection of notes on Deep Reinforcement Learning, specifically tailored for UC Berkeley's CS 285 (formerly CS 294-112) course, taught by Professor Sergey Levine. This resource serves as a textbook, covering foundational concepts like Markov decision processes and value functions, as well as advanced techniques such as Deep Q-Networks (DQN) and Proximal Policy Optimization (PPO). It integrates deep learning with reinforcement learning, discussing function approximation and representation learning. Users can compile the LaTeX source code into a PDF locally or edit it online via Overleaf, as the repository is regularly updated. The notes aim to balance theoretical clarity with practical relevance, providing examples, case studies, and programming exercises for hands-on experience.

sonata

sonata

58%

Sonata is the official project repository for "Sonata: Self-Supervised Learning of Reliable Point Representations," a CVPR'25 Highlight paper. This open-source tool provides self-supervised pre-trained Point Transformer V3 models specifically designed for various 3D point cloud downstream tasks. Users can leverage Sonata for quick inference and visualization, with easy-to-use installation options for both standalone and package modes. The repository includes pre-trained models, inference code, and visualization demos, making it accessible for researchers and developers. It supports custom data integration and offers a flexible data transformation pipeline, along with options for loading models from Huggingface or local paths, even accommodating environments without FlashAttention.

streaming

streaming

58%

Streaming is a data streaming library built by MosaicML designed to make training on large datasets from cloud storage as fast, cheap, and scalable as possible. It is specifically optimized for multi-node, distributed training for large models, ensuring correctness, performance, and ease of use. The library supports various data types including images, text, video, and multimodal data, and is compatible with major cloud storage providers like AWS, OCI, GCS, Azure, and any S3 compatible object store. It integrates seamlessly into existing training workflows as a drop-in replacement for PyTorch IterableDataset. Key features include seamless data mixing, true determinism for reproducible training runs, instant mid-epoch resumption, high throughput, and equal convergence compared to local disk solutions.

InsightNext

InsightNext

58%

InsightNext is a Google Cloud Partner specializing in AI/ML and Data Engineering. They offer deep expertise in Google Cloud Platform (GCP) and Google Workspace, helping organizations modernize their infrastructure and secure their workloads with robust governance. Their services focus on implementing AI/ML solutions and advanced data engineering practices to solve complex business challenges. InsightNext aims to drive enterprise data transformation through AI-driven cloud solutions and agentic AI systems, delivering measurable outcomes for their clients.

StableGen

StableGen

58%

StableGen is an open-source Blender addon that integrates generative AI into the 3D texturing workflow. It enables users to create fully textured 3D meshes from a single image or text prompt using TRELLIS.2, and then texture and refine them with models like SDXL, FLUX.1-dev, or Qwen Image Edit through a flexible ComfyUI backend. Key features include scene-wide multi-mesh texturing, multi-view consistency, advanced camera placement strategies, and precise geometric control with ControlNet. It also offers local editing, style guidance with IPAdapter, and integrated workflow tools like camera setup and texture baking, making it a comprehensive solution for 3D artists.