Coding & Development
Browsing page 336 of AI tools for Coding & Development. Sorted by confidence score — our independent quality rating.
ms-swift
ms-swift is a comprehensive, open-source framework developed by the ModelScope community, designed for fine-tuning and deploying large language models (LLMs) and multimodal large models (MLLMs). It supports over 600 text-only LLMs and 400 MLLMs, offering full-pipeline capabilities from training to inference, evaluation, quantization, and deployment. The framework integrates advanced training technologies, including Megatron parallelism (TP, PP, CP, EP) for acceleration and a rich family of GRPO reinforcement learning algorithms. ms-swift also supports various fine-tuning methods like LoRA, QLoRA, and DoRA, and provides memory optimization techniques such as Flash-Attention 2/3. It offers a Web-UI interface for simplified training, inference, evaluation, and quantization workflows, making it accessible for a wide range of users.
FireRedASR
FireRedASR is a family of open-source, industrial-grade automatic speech recognition (ASR) models developed by FireRedTeam. It provides robust support for Mandarin, various Chinese dialects, and English, setting new state-of-the-art benchmarks for Mandarin ASR. A key differentiator is its outstanding capability in recognizing singing lyrics. The tool offers two main variants: FireRedASR-LLM, designed for SOTA performance and seamless end-to-end speech interaction using an Encoder-Adapter-LLM framework, and FireRedASR-AED, which balances high performance with computational efficiency through an Attention-based Encoder-Decoder architecture. It also includes modules for VAD, LID, and Punc, making it a comprehensive ASR system.
Lightrun
Lightrun is an AI SRE platform designed to enhance production reliability by providing live runtime context for incident investigation and resolution. It enables developers, SREs, and AI agents to autonomously prevent and remediate software issues from code to production. The platform offers features like sandboxed instrumentation for logs, traces, metrics, and snapshots, allowing for deep code research and real-time end-to-end remediation. Lightrun helps reduce Mean Time To Resolution (MTTR) by triaging alerts, inspecting live execution, generating runtime evidence, and correlating it with code and infrastructure changes to prove root causes. It also facilitates autonomous remediation, offering fix recommendations and postmortems, and allows for validation of changes before release and testing on production traffic. The platform integrates with various tools and supports multiple programming languages and IDEs, ensuring security and compliance with standards like ISO 27001 and SOC 2 Type II.
gym-starcraft
gym-starcraft is an environment bundle designed for OpenAI Gym, specifically tailored to provide a StarCraft environment for artificial intelligence research. It leverages Facebook's TorchCraft, which acts as a bridge between the Torch deep learning library and the StarCraft game engine. This integration allows researchers and developers to create, train, and test AI agents within the complex and dynamic real-time strategy environment of StarCraft. The tool facilitates the development of advanced reinforcement learning algorithms by offering a standardized interface for interaction with the game. It is an open-source project hosted on GitHub, making it accessible for the AI research community to contribute and utilize.
generative-ai-cdk-constructs
The AWS Generative AI Constructs Library is an open-source extension of the AWS Cloud Development Kit (AWS CDK) offering multi-service, well-architected patterns for defining solutions in code. It enables developers to create predictable and repeatable infrastructure, known as constructs, for generative AI applications. The library provides high-level, multi-service abstractions of AWS CDK constructs with default configurations based on well-architected best practices. It is organized into logical modules using object-oriented techniques to model architectural patterns. The constructs are under active development and subject to non-backward compatible changes or removal in future versions, meaning users may need to update their source code when upgrading. It supports various languages including TypeScript, Python, C#, Go, and Java.
grenade
Grenade is an Open Source deep learning library implemented in Haskell, designed for developers to create sophisticated neural networks. It emphasizes composability and dependent typing, allowing for precise and concise specifications of complex network architectures, including recurrent neural networks. The library supports backpropagation and gradient application for network training, with layers represented as Haskell classes for easy customisation. Grenade facilitates the creation of networks as heterogeneous lists of layers, where types include both layers and data shapes. It also supports parallel layer execution and merging outputs, enabling the construction of series-parallel graphs and residual networks. The library is backed by hmatrix, BLAS, and LAPACK for performance, with critical functions optimized in C.
mlops-v2
The Azure MLOps (v2) solution accelerator offers enterprise-ready templates designed to streamline the deployment of machine learning models on the Azure Platform. This project serves as a foundational starting point for MLOps implementation within Azure, emphasizing repeatable, automated, and collaborative workflows. It empowers teams of ML professionals to efficiently get their machine learning models into production. The accelerator focuses on simplicity, modularity, repeatability, security, collaboration, and enterprise readiness, utilizing a template-based approach to enhance operational efficiency across the data science lifecycle. It supports both Azure DevOps and GitHub-based deployments, providing architectural patterns and quickstart guides for various project scenarios.
lit-llama
Lit-LLaMA offers an independent and open-source implementation of the LLaMA language model, building upon nanoGPT. It is designed to be simple, numerically correct, optimized for various hardware, and fully open-source under the Apache 2.0 license. The tool supports advanced features like flash attention, Int8 and GPTQ 4bit quantization for efficient memory usage, and LoRA and LLaMA-Adapter fine-tuning for adapting models to specific datasets. While this repository is no longer actively maintained, it serves as a foundational project, with its successor being the Lit-GPT project. It enables users to generate text, finetune models on custom data, and even venture into pre-training on large datasets like RedPajama.
Shimoku
Shimoku offers an analyst agent designed to assist users with data analysis and insight generation. While specific features are not detailed on the homepage, the tool's primary focus appears to be on leveraging AI to support analytical tasks. The platform aims to streamline the process of understanding complex data, potentially through automated reporting or intelligent data exploration. Its positioning as an "Analyst Agent" suggests a capability to act as a virtual assistant for data-driven decision-making, catering to individuals or teams who require efficient data interpretation.
neural-compressor
Intel Neural Compressor is an open-source Python library developed by Intel, offering advanced model compression techniques for deep learning frameworks like PyTorch, TensorFlow, and JAX. It supports a wide range of low-bit quantization methods, including INT8, FP8, MXFP8, INT4, MXFP4, and NVFP4, as well as sparsity. The library is designed to optimize the performance of Large Language Models (LLMs) and Vision-Language Models (VLMs) on Intel hardware such as Xeon Scalable Processors, Core Ultra Processors, and Gaudi AI Accelerators, with limited support for AMD and ARM CPUs, and NVIDIA GPUs. Key features include Static Quantization, Dynamic Quantization, SmoothQuant, Weight-Only Quantization, and Quantization-Aware Training, making it a comprehensive solution for deploying efficient AI models.
Barbara
Barbara is an Edge AI platform designed for industrial companies to deploy, run, and monitor Edge Applications and AI models directly on-site. It offers a simplified approach to managing industrial infrastructure compared to traditional cloud solutions. The platform provides container orchestration, industrial connectors for various assets, and ecosystem integration, allowing users to deploy Docker-based apps and integrate with existing development environments. For AI/ML developers, Barbara facilitates model deployment to Edge Nodes and offers an Apps Marketplace for off-the-shelf tools. Edge Infrastructure Managers benefit from effortless device lifecycle management, professional-grade network connectivity, and zero-touch provisioning for faster deployments. The platform emphasizes cybersecurity, IT/OT convergence, and MLOps capabilities to optimize and package trained models for efficient inference.
Bloop
Bloop is building tools designed to empower engineers to plan, orchestrate, and review the work of autonomous AI agents. The platform aims to provide the necessary infrastructure to significantly multiply output, especially as the industry transitions from simple auto-complete functionalities to more complex, long-running tasks. Its core mission is to transform every engineer into a high-velocity engineering manager by streamlining the management of AI agent workflows. This focus on agent orchestration and review positions Bloop as a critical tool for developers working with advanced AI systems, enabling more efficient and effective project execution.
OpenML
OpenML is a collaborative online machine learning platform designed to facilitate the sharing and organization of data, machine learning algorithms, and experimental results. It aims to create a frictionless, networked ecosystem where scientists and practitioners can easily integrate their existing processes and tools to collaborate globally. The platform provides significant benefits for science by enabling rapid building upon others' results, answering complex questions quickly through prior experiments, and making larger studies feasible. For scientists, it saves time on routine duties, compares new experiments to the state of the art, and offers potential for new discoveries and publications. OpenML also serves as a valuable learning environment for students and citizen scientists, allowing them to explore state-of-the-art methods and contribute their own work.
PyTorch-BayesianCNN
PyTorch-BayesianCNN provides an implementation of Bayesian Convolutional Neural Networks (CNNs) with variational inference, specifically utilizing Bayes by Backprop, within the PyTorch framework. This tool allows researchers and developers to build CNNs that can infer intractable posterior probability distributions over weights, offering a significant advantage over traditional frequentist approaches by providing uncertainty estimations. It includes two types of Bayesian layer implementations: BBB (Bayes by Backprop) and BBB_LRT (Bayes by Backprop with Local Reparametrization Trick), which enhances sampling efficiency. The repository supports standard datasets like MNIST, CIFAR10, and CIFAR100, and includes implementations of common models such as AlexNet and LeNet, making it a valuable resource for experimenting with Bayesian deep learning and understanding model uncertainty.
pytorch_active_learning
pytorch_active_learning is an open-source PyTorch library designed for active learning, accompanying the "Human-in-the-Loop Machine Learning" book. It offers a range of active learning methods, including Least Confidence, Margin of Confidence, Ratio of Confidence, and Entropy sampling. The library also supports more advanced techniques like Model-based Outlier sampling, Cluster-based sampling, and various forms of Active Transfer Learning. It is suitable for researchers and practitioners looking to experiment with and apply active learning strategies in computer vision and natural language processing, with a focus on real-world diversity to avoid bias. The code is stand-alone and can be easily integrated with existing PyTorch installations.
SimGNN
SimGNN is a PyTorch implementation of a novel neural network approach designed for fast graph similarity computation, as detailed in the WSDM 2019 paper. It addresses the computational burden of traditional methods like Graph Edit Distance (GED) and Maximum Common Subgraph (MCS) while maintaining high performance. The tool employs a learnable embedding function to map graphs into embedding vectors, providing a global summary. A key feature is its attention mechanism, which emphasizes important nodes for specific similarity metrics. Additionally, SimGNN includes a pairwise node comparison method to supplement graph-level embeddings with fine-grained node-level information. This approach leads to better generalization on unseen graphs and offers quadratic time complexity in the worst case. Experimental results demonstrate its effectiveness and efficiency, achieving smaller error rates and significant time reductions compared to existing baselines.
Self-Driving-Car-in-Video-Games
Self-Driving-Car-in-Video-Games is an open-source project featuring a supervised deep neural network designed to learn autonomous driving within video games, specifically Grand Theft Auto V. The model, named T.E.D.D. 1104, is trained using extensive human-labeled data, recording gameplay and key inputs to teach it how to navigate various vehicles under different weather conditions. It approaches the task as a classification problem, taking a sequence of five images as input and predicting the correct keyboard or Xbox controller inputs. The project provides pretrained models of varying sizes (XXL, M, S) and includes all necessary files for data generation, training, and real-time inference, primarily supporting Windows 10/11 for gameplay interaction.
tensorflow-federated
TensorFlow Federated (TFF) is an open-source framework designed for machine learning and other computations on decentralized data. It specifically supports Federated Learning (FL), an approach where a shared global model is trained across many participating clients while their sensitive training data remains local. This framework enables developers to utilize included federated learning algorithms with their existing TensorFlow models and data, or to experiment with novel algorithms. TFF provides both a high-level Federated Learning (FL) API for applying federated training and evaluation, and a lower-level Federated Core (FC) API for expressing new federated algorithms. It includes a single-machine simulation runtime for experiments, making it suitable for researchers and developers exploring privacy-preserving machine learning.
veles
Veles is a distributed platform designed for rapid deep learning application development, released under the Apache 2.0 license. It comprises several key components, including the core Veles platform, the Znicz Plugin which serves as a neural network engine, and Mastodon, a bridge facilitating integration between Veles and Java-based systems like Hadoop. Additionally, it features a SoundFeatureExtraction library for audio processing. This platform is ideal for developers and researchers looking to build and deploy deep learning applications in a distributed environment, offering tools for both model development and data processing.
transformer-xl-chinese
transformer-xl-chinese is an open-source project that leverages the Transformer-XL model for advanced Chinese text generation. This tool allows users to generate various forms of Chinese text, including novels, ancient poetry, and general conversational topics. Key functionalities include the ability to perform inference, visualize attention mechanisms within the model, and examine candidate words for generated text. The project builds upon existing Transformer-XL implementations, with specific modifications to support Chinese text generation and enhance usability through added inference capabilities and visualization tools. It provides scripts for data preparation, training, and inference, making it accessible for developers and researchers interested in exploring and applying Transformer-XL to Chinese language tasks.
TransmogrifAI
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an open-source AutoML library written in Scala, designed to run on Apache Spark. Developed by Salesforce, it focuses on enhancing machine learning developer productivity by automating various stages of the ML workflow, from feature engineering and validation to model selection. The library enforces compile-time type-safety, modularity, and reusability, enabling the creation of robust machine learning applications in a fraction of the time compared to traditional hand-tuned methods. It supports building models with minimal machine learning expertise, making advanced ML accessible to a broader range of developers. TransmogrifAI is particularly useful for structured data and offers flexibility for users who require more control over their ML pipelines.
Bert-Multi-Label-Text-Classification
Bert-Multi-Label-Text-Classification offers a PyTorch implementation of pretrained BERT and XLNET models specifically tailored for multi-label text classification. This open-source repository includes a structured codebase with modules for callbacks, configuration, dataset handling, model architecture, output management, text preprocessing, and training. Developers can fine-tune BERT models, preprocess data, and predict new data using provided scripts. The tool supports various dependencies like PyTorch, transformers, and scikit-learn, making it a robust solution for NLP tasks requiring multi-label classification.
TPVFormer
TPVFormer is an academic project offering a Tri-Perspective View (TPV) representation for vision-based 3D semantic occupancy prediction, serving as an alternative to Tesla's Occupancy Network for autonomous driving research. It addresses the limitations of traditional bird's-eye-view (BEV) representations by incorporating two additional perpendicular planes, allowing for a more fine-grained description of 3D scenes. The tool features a transformer-based TPV encoder (TPVFormer) to effectively obtain TPV features by aggregating image features. It demonstrates that camera inputs alone can achieve performance comparable to LiDAR-based methods on LiDAR segmentation tasks. The project also includes resources for semantic scene completion and comparisons with Tesla's Occupancy Network.
Tetris-deep-Q-learning-pytorch
Tetris-deep-Q-learning-pytorch is an open-source Python project that demonstrates the application of Deep Q-learning for training an AI agent to play the classic game Tetris. Developed with PyTorch, this tool serves as a foundational example of reinforcement learning in action. Users can leverage the provided source code to train their own Tetris-playing models from scratch or test pre-trained models. The project includes all necessary scripts for training and testing, making it accessible for those interested in understanding and experimenting with AI agents and deep learning techniques in a practical gaming context. It's an excellent resource for students and developers exploring the basics of reinforcement learning.