ShypdShypd.ai
💻

Coding & Development

Browsing page 152 of AI tools for Open Source & Models in Coding & Development. Sorted by confidence score — our independent quality rating.

Satellite-Imagery-Datasets-Containing-Ships

Satellite-Imagery-Datasets-Containing-Ships

58%

Satellite-Imagery-Datasets-Containing-Ships is a comprehensive GitHub repository that curates radar and optical satellite datasets specifically designed for ship detection, classification, semantic segmentation, and instance segmentation tasks. These datasets are invaluable for researchers and developers working in computer vision, machine learning, remote sensing, and maritime analysis. The repository details various datasets, including SSDD, OpenSARship, SAR-Ship-Dataset, AIR-SARShip, HRSID, LS-SSDD, and FUSAR-Ship, providing information on their authors, year, tasks supported, and direct access links. Each dataset entry includes specifics like image dimensions, spatial resolutions, polarization types, and annotation formats, making it a crucial resource for developing and evaluating algorithms for maritime surveillance and naval operations.

SpatialLM

SpatialLM

58%

SpatialLM is a 3D large language model designed to process 3D point cloud data and generate structured 3D scene understanding outputs. It can identify architectural elements such as walls, doors, and windows, as well as oriented object bounding boxes with their semantic categories. A key differentiator is its ability to handle point clouds from diverse sources, including monocular video sequences, RGBD images, and LiDAR sensors, unlike previous methods that often required specialized equipment. This multimodal architecture bridges the gap between unstructured 3D geometric data and structured 3D representations, providing high-level semantic understanding. SpatialLM enhances spatial reasoning capabilities for applications in embodied robotics, autonomous navigation, and other complex 3D scene analysis tasks. It offers models like SpatialLM1.1-Llama-1B and SpatialLM1.1-Qwen-0.5B, available on Hugging Face, and supports detection with user-specified categories.

rl

rl

58%

TorchRL is an open-source Reinforcement Learning (RL) library built for PyTorch, emphasizing a modular, primitive-first, and Python-first design. It provides a comprehensive framework for developing and deploying RL agents, featuring a command-line training interface for state-of-the-art agents without extensive coding. The library also includes a revamped vLLM integration for scalable LLM inference and training, offering features like AsyncVLLM service, multiple load balancing strategies, and distributed data loading. Additionally, TorchRL offers an experimental PPOTrainer for configurable PPO training solutions and a complete LLM API for fine-tuning language models, supporting RLHF, supervised fine-tuning, and tool-augmented training. Its design principles align with the PyTorch ecosystem, ensuring efficiency, extensibility, and minimal dependencies.

dreamtalk

dreamtalk

58%

DreamTalk is an open-source framework designed for generating expressive talking head videos. It utilizes diffusion probabilistic models to create high-quality videos that capture diverse speaking styles. The tool is robust, handling a wide array of inputs including songs, speech in multiple languages, and even noisy audio, and can work with out-of-domain portraits. Users can specify audio paths, style clips, head poses, and input images to generate videos. While the primary focus is on accurate lip-sync and vivid expressions, the resolution can be improved using external solutions like CodeFormer or MetaPortrait's Temporal Super-Resolution Model. The project provides inference code and pretrained checkpoints, though access to checkpoints requires an email request for academic research purposes.

streamlit-fastapi-model-serving

streamlit-fastapi-model-serving

58%

streamlit-fastapi-model-serving is an open-source project designed to simplify the deployment of machine learning models. It leverages FastAPI for creating a robust backend with automatic API documentation and Streamlit for building an interactive, user-friendly frontend. This combination allows developers to quickly serve PyTorch models, providing both a programmatic interface for other applications and a visual interface for direct user experimentation. The project uses Docker Compose to orchestrate these two services, ensuring seamless communication and easy setup. It's an ideal solution for developers looking to deploy ML models with a complete web application stack.

TextClassification-Keras

TextClassification-Keras

58%

TextClassification-Keras is a comprehensive code repository designed for implementing deep learning models for text classification tasks using the Keras framework. It offers ready-to-use implementations of popular models such as FastText, TextCNN, and TextRNN, making it a valuable resource for researchers and developers. The repository simplifies the application of these advanced models to text classification problems, supporting both English and Chinese documents. It serves as an excellent starting point for those looking to explore or integrate deep learning-based text classification into their projects, providing a foundational codebase for further development and experimentation.

techniques

techniques

58%

The 'techniques' GitHub repository serves as a comprehensive resource for deep learning methods specifically tailored for satellite and aerial imagery analysis. It provides an organized overview of various techniques designed to handle the unique challenges of processing large-scale image datasets. The repository focuses on methodologies for identifying diverse object classes within these images, making it a valuable asset for researchers and developers in the field. As an open-source project, it is freely accessible for both research and development purposes, fostering collaboration and advancement in the application of AI to geospatial data.

TFC-pretraining

TFC-pretraining

58%

TFC-pretraining is a specialized tool designed for self-supervised contrastive learning, specifically tailored for time series data. It leverages a novel approach called time-frequency consistency to significantly improve the learning process and the quality of representations derived from complex time series. The tool provides researchers and practitioners with not only the underlying methodology but also includes processed datasets and readily available code for implementing the technique. This makes it an invaluable resource for those working in time series analysis, enabling them to explore advanced predictive analytics and pattern recognition with greater efficiency and accuracy. Its focus on robust representation learning addresses key challenges in handling sequential data.

torchcv

torchcv

58%

TorchCV is a PyTorch-based framework designed for deep learning applications in computer vision. It offers a comprehensive collection of implementations for various models, primarily focusing on image classification and other common computer vision tasks. The framework is built with the goal of keeping pace with the latest advancements and research in the field, providing developers with up-to-date resources. While the provided content is a GitHub pricing page, the context indicates torchcv is a tool for developers working with computer vision models, likely open-source given its GitHub presence. It serves as a valuable resource for those looking to implement or experiment with state-of-the-art computer vision algorithms.

AnySplat

AnySplat

58%

AnySplat is an open-source tool designed for feed-forward 3D Gaussian Splatting from unconstrained views. It utilizes a transformer-based geometry encoder followed by three decoder heads to predict Gaussian parameters, depth maps, and camera poses. These outputs are then used to construct pixel-wise 3D Gaussians, which are voxelized and rendered into multi-view images and depth maps. The tool supports training and inference, with code available for installation and quick start. It also includes a Gradio-based demo for visualizing reconstructed 3D Gaussian Splats from uploaded images or videos, making it a valuable resource for researchers and developers in computer vision and graphics.

awesome-gpt4

awesome-gpt4

58%

awesome-gpt4 is an open-source GitHub repository offering a comprehensive, curated list of resources centered around the GPT-4 language model. It serves as a valuable hub for researchers, developers, and enthusiasts looking to delve deeper into GPT-4's applications and advancements. The repository categorizes resources into several key areas, including impactful scientific papers, a diverse collection of open-source projects leveraging GPT-4, community-contributed demos showcasing its capabilities, and various product integrations that utilize the model. Additionally, it features a section dedicated to GPT-4 news and announcements, keeping users updated on the latest developments. A significant part of awesome-gpt4 is its collection of impressive prompts, demonstrating effective ways to interact with GPT-4 for various tasks, from acting as a pharmacologist or lawyer to a debugger or mobile app developer. This makes it an indispensable resource for understanding, experimenting with, and developing applications based on GPT-4.

use-stick-to-bottom

use-stick-to-bottom

58%

use-stick-to-bottom is a lightweight, zero-dependency React Hook and Component specifically designed for AI chat applications. It automatically sticks to the bottom of a container and smoothly animates content to maintain its visual position as new messages are added. This tool does not rely on `overflow-anchor` CSS support, making it compatible with browsers like Safari. It uses the `ResizeObserver` API to detect content resizing, supporting both content growth and shrinking without losing stickiness. The hook also correctly handles scroll anchoring, preventing content jumps when elements above the viewport resize. Users can cancel stickiness by scrolling up, with clever logic distinguishing user scrolls from animation events. It features a custom smooth scrolling algorithm with velocity-based spring animations, ideal for streaming content with variable sizing common in AI chatbots.

AViD

AViD

58%

AViD is a streamlined toolkit designed for fine-tuning state-of-the-art vision-language detection models with parameter-efficient adaptation. Built upon the powerful Grounding DINO framework, AViD introduces capabilities for fine-tuning image-to-text grounding on custom datasets, which is crucial for applications requiring precise alignment between textual descriptions and image regions. Key features include a complete fine-tuning pipeline, parameter-efficient training with LoRA (allowing training of only ~2% of parameters), and EMA stabilization to retain pre-trained knowledge. It also offers optional phrase-based NMS for removing redundant boxes and includes a sample fashion dataset for immediate experimentation. The framework provides a comprehensive evaluation suite with metrics like mAP, Precision, Recall, and F1 Score, along with visualizations and detailed reporting.

brian2

brian2

58%

Brian2 is a free, open-source simulator for spiking neural networks, primarily written in Python. It provides a user-friendly and efficient platform for researchers to model and simulate complex neural circuits. The simulator is designed with ease of learning and use in mind, aiming to save scientists' time in addition to processing power. Brian2 is highly flexible and easily extensible, making it suitable for a wide range of neuroscience research applications. It is available on almost all platforms and offers comprehensive documentation. Users are encouraged to report issues via GitHub or the Brian forum and to cite the provided article if used for published research.

bullet3

bullet3

58%

bullet3 is the official C++ source code repository for the Bullet Physics SDK, offering real-time collision detection and multi-physics simulation capabilities. It is widely used across various domains including virtual reality, game development, visual effects, robotics, and machine learning. The SDK supports a range of platforms like Windows, Linux, Mac OSX, iOS, and Android, and includes experimental OpenCL GPGPU support for accelerating collision detection and rigid body dynamics. Users can also leverage PyBullet, Python bindings for enhanced support in robotics, reinforcement learning, and VR, with simple installation via pip. The project is licensed under the permissive zlib license.

wenet

wenet

58%

wenet is an open-source, production-first, end-to-end speech recognition toolkit designed to offer comprehensive solutions for automatic speech recognition (ASR). The project emphasizes production readiness and ease of use, making it suitable for developers and organizations looking to integrate robust speech recognition capabilities into their applications. It provides the foundational components necessary for building and deploying ASR systems, focusing on practical implementation rather than just research. The toolkit is hosted on GitHub, indicating a collaborative development model and accessibility for the developer community.

weight-loss

weight-loss

58%

weight-loss is an open-source GitHub repository that leverages machine learning to analyze personal weight and lifestyle data, helping users understand the factors contributing to weight changes. It provides scripts for data collection, conversion to a machine learning-friendly format (Vowpal Wabbit), and analysis to identify correlations between lifestyle choices (diet, sleep, exercise) and weight fluctuations. The project emphasizes a personal journey of experimentation and discovery, offering insights into effective weight loss strategies, particularly those related to ketosis and fasting. Users can apply the provided code to their own data to gain personalized insights, with a focus on identifying significant factors like sleep and carbohydrate intake.

deepframeworks

deepframeworks

58%

deepframeworks offers a comprehensive evaluation of popular deep learning toolkits, including Caffe, CNTK, TensorFlow, Theano, and Torch. This resource, though last updated in early 2016, provides detailed insights into each framework's modeling capability, interfaces, model deployment, performance, architecture, and ecosystem. It highlights strengths and weaknesses, such as Caffe's strong computer vision support versus poor recurrent network capabilities, or TensorFlow's clean architecture but lack of Windows support at the time. The evaluation also covers cross-platform compatibility and performance benchmarks, making it a valuable historical reference for understanding the evolution of deep learning frameworks.

VisualThinker-R1-Zero

VisualThinker-R1-Zero

58%

VisualThinker-R1-Zero is an open-source project that replicates DeepSeek-R1-Zero for visual reasoning tasks, specifically focusing on multimodal "aha moments." This tool demonstrates emergent reasoning capabilities and increased response length using a 2B non-SFT (non-Supervised Fine-Tuning) model. It allows researchers to explore how vision-centric tasks can benefit from improved reasoning, even observing self-reflection behavior during RL training on visual tasks. The project provides detailed instructions for setup, dataset preparation, and training using GRPO (Generalized Reinforcement Learning with Policy Optimization) for both multimodal aha moment reproduction and SFT model comparison. Evaluation scripts for CVBench are also included, making it a valuable resource for academic research in multimodal AI and visual understanding.

VLMEvalKit

VLMEvalKit

58%

VLMEvalKit is an open-source evaluation toolkit designed for large vision-language models (LVLMs), supporting over 220 LMMs and 80+ benchmarks. It simplifies the evaluation process by allowing one-command evaluation without extensive data preparation across multiple repositories. The toolkit uses generation-based evaluation for all LVLMs, offering results with both exact matching and LLM-based answer extraction. Recent updates include improved handling for models with thinking mode and long responses, as well as multi-node distributed inference support for faster evaluations. It aims to provide an easy-to-use, reproducible evaluation environment for researchers and developers.

RnPsoft

RnPsoft

58%

RnPsoft is a pioneering technology company dedicated to building tomorrow’s solutions today. They are at the forefront of the technology world, delivering top-tier software and applications that redefine how businesses and individuals operate. RnPsoft offers a comprehensive suite of services including MI/A.I solutions, app development, software development, blockchain solutions, and real-time solutions. Their team of expert developers and engineers are committed to turning client visions into reality, whether it's robust software to streamline business processes or intuitive applications to engage customers. They also provide educational services and focus on empowering visions through innovative and tailored solutions.

Human Feedback Foundation

Human Feedback Foundation

58%

The Human Feedback Foundation is a nonprofit member of the Linux Foundation AI & Data, dedicated to fostering more open and human-centered AI futures. They achieve this by building capacity across Canada's AI ecosystem, engaging over 500 nonprofits, 4,300 builders, and 200 students. Key initiatives include RAISE, Canada's first national program for nonprofit AI adoption, and AI in Production Mentorship, which closes the apprenticeship gap in AI education. The foundation also runs AI Tinkerers, Canada's largest AI/ML builders community, and participates in HUMAINE, an open AI evaluation at scale project. They publish insights and resources like the AI Adoption Playbook to guide responsible AI implementation.

Nyalazone Solutions Pvt. Ltd.

Nyalazone Solutions Pvt. Ltd.

58%

Nyalazone Solutions Pvt. Ltd. provides AI-enabled digital transformation solutions through its suite of enterprise-grade platforms. Their flagship products include Leggero.ai, Leggero Data Management & Analytics Platform (DMAP), Leggero Dynamic Data Source (DDS), and Leggero Digital Customer Engagement Platform (DCE). These platforms are designed to accelerate complex solution delivery, drive measurable impact, and modernize business processes. Nyalazone emphasizes rapid deployment, fit-to-purpose solutions, and cost efficiency, making their offerings scalable and adaptable for various organizational needs. They also provide advanced data integration, audit and risk compliance, complex data migration, Gen-AI enabled process automation, omnichannel customer engagement, and operations management using activity orchestration.

Epigram

Epigram

58%

Epigram, hosted on ExpiredDomains.com, is a platform dedicated to the sale of expired domain names, specifically highlighting epigram.news. It functions as a marketplace where users can purchase premium expired domains, often through GoDaddy. The platform offers extensive listings across numerous TLDs, updated daily, and provides unique data metrics to assess domain value. These metrics include SEO properties, keyword search volume, and historical data from WHOIS and Wayback Machine. It aims to assist SEOs, marketers, and investors in finding domains with existing authority, traffic, or strong branding potential, offering a user-friendly interface with quick filtering capabilities.