AI Agents & Automation
Browsing page 466 of AI Agents & Automation. Sorted by confidence score — our independent quality rating.
torchscale
torchscale is a PyTorch library specifically engineered to facilitate the scaling of Transformer models, which are fundamental to modern large language models. It emphasizes key aspects such as modeling generality and capability, ensuring that the models can be applied across a wide range of tasks and perform robustly. The library also prioritizes training stability and efficiency, crucial for developing and managing large-scale foundation models. By providing tools and frameworks within the PyTorch ecosystem, torchscale aims to empower researchers and developers to build, train, and deploy increasingly complex and powerful AI models more effectively.
KEF Robotics
KEF Robotics specializes in state-of-the-art autonomy software designed to enable aircraft and Unmanned Aircraft Systems (UAS) to fly without human intervention. At its core, the company utilizes advanced computer vision algorithms that process camera data to facilitate autonomous flight across a diverse range of platforms and use cases. The primary mission of KEF Robotics is to enhance the safety and reliability of aircraft operations, while simultaneously expanding their operational range and utility. Founded in Pittsburgh in 2018, KEF Robotics aims to integrate its technology with existing UAS systems, offering a solution for advanced aerial autonomy.
AI Browser - Safety & Fast
AI Browser is an Android mobile application designed to offer users a secure and private web browsing experience. It prioritizes user privacy by including an incognito mode, ensuring that no browsing history or traces are left behind after a session. Beyond privacy, the app enhances user productivity by providing a personalized, real-time news feed, keeping users updated with relevant information. Additionally, it features a built-in capability to download videos directly within the app, streamlining content consumption on the go. This combination of security, personalization, and utility makes AI Browser a comprehensive solution for mobile web users.
Uni-ControlNet
Uni-ControlNet is an advanced AI tool designed to offer comprehensive control over text-to-image diffusion models. It provides an all-in-one method for controllable image synthesis, allowing users to precisely guide the generation process. The tool unifies various control aspects, simplifying the creation of specific image outputs. Based on research presented at NeurIPS 2023, Uni-ControlNet aims to enhance the flexibility and accuracy of AI-driven image generation, making it a valuable resource for researchers and developers working with diffusion models.
UER-py
UER-py (Universal Encoder Representations) is an open-source framework designed for pre-training on general-domain corpora and fine-tuning on downstream NLP tasks using PyTorch. It emphasizes model modularity, allowing users to combine various embedding, encoder, decoder, and target modules to construct custom pre-training models. The toolkit supports CPU, single GPU, and distributed training modes, making it versatile for different computational environments. UER-py also provides a comprehensive model zoo with pre-trained models of diverse properties, facilitating their direct use in various applications. It has been tested for reproducibility against original implementations of models like BERT, GPT-2, ELMo, and T5, and offers solutions for numerous NLP competitions.
Spell Corrector: Grammar Check
Spell Corrector: Grammar Check is an Android mobile application designed to enhance English writing by providing instant spelling and grammar correction. It identifies errors as you type, offering smart suggestions from its comprehensive English dictionary. Beyond basic corrections, the app includes a speech-to-text feature, allowing users to dictate text with perfect spelling. Its integrated keyboard ensures that corrections are available across all applications, from emails to messages and documents. The tool also enables users to save and organize their writing, making it a comprehensive solution for improving written communication and ensuring accuracy on the go.
voicebox
voicebox is an open-source voice synthesis studio that leverages Qwen3-TTS to provide a private and customizable environment for voice generation. This tool enables users to clone existing voices, generate new speech, and develop various voice-powered applications directly on their local machines. By running locally, voicebox ensures privacy and offers extensive customization options, making it suitable for developers and content creators who require fine-grained control over their audio output. Its open-source nature fosters community contributions and allows for continuous improvement and adaptation to specific user needs, providing a flexible solution for advanced voice synthesis tasks.
Currents.one
Currents AI is an AI-powered social media intelligence platform designed to transform social media strategy through real-time insights. It enables users to discover trending topics, analyze competitors, and engage effectively with their audience across all major platforms. The platform offers a real-time social search engine to uncover long-tail conversations and user pain points from forums, Reddit, reviews, and social threads. Key features include semantic social search for context-rich results, user story extraction to auto-structure comments into actionable needs, and competitor intelligence to track reactions. Currents AI also provides Murmur Lab, an AI PM Workspace, to turn unfiltered user buzz into product clarity, focusing on minority group insights and product roadmapping intelligence.
Neuro (ADHD)
Neuro (ADHD), also known as Claudia, is an AI personal assistant specifically built for adults with ADHD. It offers a voice-first interface, allowing users to speak their thoughts and have Claudia organize them into actionable tasks, reminders, and routines. The tool is designed to reduce the cognitive load associated with traditional productivity systems, which often overwhelm ADHD brains. Claudia helps with task breakdown, prioritization, scheduling, note-taking, and goal management, consolidating multiple functions into a single platform. Developed by individuals with ADHD, it aims to provide a supportive and understanding companion for managing daily life.
ModelOp
ModelOp is a leading AI lifecycle management and governance platform designed for enterprises. It provides a centralized AI system of record, enabling visibility into all internal and third-party AI solutions. The platform automates AI deployment with enforceable policies, accelerating time-to-production for ML, GenAI, Agentic AI, and vendor AI. ModelOp helps organizations control costs, ensure audit-readiness, and deliver executive insights by integrating with existing systems to orchestrate governance. It supports various industries and roles, offering solutions for AI governance, risk management, and compliance with standards like NIST AI RMF and EU AI Act.
Jarvis-Desktop-Voice-Assistant
Jarvis-Desktop-Voice-Assistant is a Python-based desktop voice assistant designed to automate daily tasks through voice commands. It integrates speech recognition and text-to-speech capabilities, allowing users to execute system-level commands, open applications and websites, perform Wikipedia and Google searches, play music, take notes, and capture screenshots. While not as intelligent as its movie namesake, it offers a range of practical functionalities for personal computer users. The project is fully completed, error-free, and built with Python 3.6+. It supports asynchronous user interactions and is open-source under an MIT license, encouraging community contributions and further development.
Meera.AI
Meera.AI is an AI texting platform designed for sales and marketing teams to automate conversations and enhance engagement with prospects and customers. It leverages human-like messages to qualify and nurture leads, schedule appointments, and manage events. The platform supports multilingual texting in over 90 languages and offers compliance controls. Meera.AI integrates with popular tools like Salesforce and HubSpot, and can warm-transfer qualified leads to agents. It aims to significantly increase connect rates and application rates by automating outreach and follow-up tasks, freeing up sales teams to focus on high-value interactions.
Dencity - Virtual Science Lab
Dencity is an AI Science Lab designed for schools and educators, offering over 330 interactive 3D experiments across Physics, Chemistry, and Biology. This tool transforms science education by allowing students to actively run experiments, change variables, and observe real-time results, fostering a deeper understanding compared to passive video learning. It supports major educational boards like CBSE, ICSE, IGCSE, Maharashtra State Board, and NIOS, with experiments mapped to specific chapters for easy integration into curricula. Dencity provides AI-powered step-by-step guidance for teachers, ensuring smooth experiment setup and clear explanations. The platform is accessible on Windows desktops, Android phones/tablets, and iOS devices, requiring no special hardware. It also includes features for homework assignments, submissions, and collaborative group experiments, creating a safe and risk-free virtual environment for scientific exploration.
SpeechKITT
SpeechKITT offers a flexible graphical user interface (GUI) designed to streamline the integration of speech recognition capabilities into websites. It provides a user-friendly interface for starting, stopping, and monitoring the status of speech recognition. SpeechKITT is compatible with different speech recognition engines, including direct webkitSpeechRecognition usage and libraries like annyang. Developers can easily guide users on voice interaction, provide instructions, and even facilitate natural conversations with follow-up questions. The tool is highly customizable, offering multiple themes and instructions for creating custom designs, making it adaptable to various web application needs.
sandbox
AIO Sandbox is a comprehensive, all-in-one agent sandbox environment designed for AI agents and developers. It integrates a browser, shell, file system, Model Context Protocol (MCP) operations, and a VSCode Server within a single Docker container. This unified setup addresses the challenges of traditional single-purpose sandboxes by offering a shared filesystem, multiple interfaces like VNC, VSCode, Jupyter, and Terminal, and secure execution for Python and Node.js. The tool is agent-ready with MCP-compatible APIs, enabling seamless integration for AI agent development and testing. It also features zero configuration, providing pre-configured MCP servers and development tools out-of-the-box.
Salvia - Tarot & Psychics
Salvia is a mobile application designed to connect users with professional psychic advisors for a variety of mystical services. Users can access tarot card readings, astrological forecasts, numerology, spiritual counseling, and other forms of divination. The platform emphasizes high-quality, professional services with advisors available 24/7. It offers diverse communication options, including quick questions, voice calls, and live sessions, allowing users to choose their preferred method. Salvia prioritizes user privacy and data security, ensuring a confidential experience. The app aims to provide guidance and clarity on personal growth, relationships, and career paths through its network of experienced psychics and spiritual healers.
Smart AI Browser & Downloader
Aarna Infotech is a leading IT solutions provider established in 2015, specializing in delivering innovative technology solutions tailored to business needs. Their comprehensive service offerings include custom software development for streamlining business processes, mobile app development for iOS and Android, and scalable cloud solutions covering migration, management, and optimization. They also provide robust cybersecurity measures to protect data and infrastructure, powerful data analytics for actionable insights, and cutting-edge AI & Machine Learning solutions to automate processes and enhance decision-making. With a team of certified professionals, Aarna Infotech aims to drive business growth and digital transformation for their clients.
ClipBERT
ClipBERT is an official PyTorch code implementation for an efficient framework designed for end-to-end learning across image-text and video-text tasks. Recognized with a CVPR 2021 Best Student Paper Honorable Mention, ClipBERT processes raw videos/images and text inputs to generate task predictions. It leverages 2D CNNs and transformers, incorporating a sparse sampling strategy to enable efficient multimodal learning. The framework supports end-to-end pretraining and finetuning for tasks such as image-text pretraining on COCO and VG captions, text-to-video retrieval on MSRVTT, DiDeMo, and ActivityNet Captions, video-QA on TGIF-QA and MSRVTT-QA, and image-QA on VQA 2.0. Its modular design allows for easy integration of additional image-text or video-text tasks.
Arintra
Arintra offers an autonomous medical coding platform designed for healthcare organizations to enhance accuracy, reduce claim denials, and accelerate payment processing. By leveraging AI and deep medical expertise, Arintra integrates seamlessly with major EHRs such as Epic and Athena, eliminating the need for workflow changes. The platform aims to unlock missed revenue, improve compliance, and reallocate staff to higher-value tasks. It boasts impressive results, including significant revenue uplift, cost savings, and reductions in denials and pre-A/R days, while maintaining high coding accuracy. Arintra supports various specialties, from Internal Medicine to Orthopedics, and offers a 90-day risk-free trial to demonstrate measurable results quickly.
hear
Hear is an AI-powered contact center intelligence platform designed for CX leaders to optimize agents, gain insights, and significantly improve customer satisfaction. The platform offers autonomous, AI-native clarity, eliminating the need for dashboards or manual work by providing proactive insights and 100% visibility. It caters to various roles including Customer Experience, Compliance, Operations, and Sales & Marketing, helping teams act on insights rather than just waiting for them. Key features include automated compliance monitoring, AI-driven insights to streamline workflows, and identification of sales opportunities. Hear integrates seamlessly with existing platforms, ensuring security and privacy are built-in, not bolted on.
Vindey
Vindey is an AI-powered platform designed to revolutionize property management for letting agents, landlords, and property management companies. It automates critical aspects of the tenancy cycle, including handling enquiries, scheduling viewings, processing applications, and managing renewals. The tool provides 24/7 maintenance coverage by triaging requests and offering instant self-help to tenants, escalating only when necessary. Vindey unifies all communications across channels like WhatsApp, email, SMS, and phone, supporting over 30 languages to ensure seamless interaction. It aims to reduce administrative burden, improve tenant satisfaction, and increase efficiency for property professionals managing portfolios of any size.
Tapway
Tapway is a no-code computer vision AI platform designed to automate visual inspection and enable real-time actions across various industries. It harnesses Vision AI to convert video feeds and images into actionable intelligence, driving business growth. The platform allows users to capture real-time visual data using existing cameras, automatically detect patterns, anomalies, and compliance issues, and instantly trigger automated alerts or actions. Tapway offers products like SamurAI for end-to-end Vision AI, VehicleTrack for automatic car plate recognition and vehicle profiling, and PeopleTrack for analyzing customer traffic and behavior. Its applications span plate number recognition, optical character recognition, fruit counting and classification, footfall tracking, PPE compliance detection, and quality inspection.
tflite_gles_app
tflite_gles_app offers GPU-accelerated deep learning inference applications, leveraging TensorFlow Lite GPU Delegate and TensorRT for enhanced performance. This open-source project is designed for platforms such as Raspberry Pi, NVIDIA Jetson, and Linux PCs. It includes a variety of applications covering tasks like lightweight and high-accuracy face detection (Blazeface, DBFace), age and gender estimation, image classification, object detection, 3D facial surface geometry estimation (Facemesh), hair segmentation, 3D handpose estimation, iris detection, 3D object detection, various pose estimations (Blazepose, Posenet), 3D human pose estimation, depth estimation, semantic segmentation, face segmentation, selfie-to-anime transformation, artistic style transfer, and text detection. The repository provides detailed instructions for building and running applications on different target environments, supporting both live camera and recorded video file inputs.
Gloabi
Gloabi introduces a truly personal and autonomous AI designed to be a digital extension of the user. This 'Super AI' possesses its own identity and email address, and continuously learns user preferences, communication style, and interests through ongoing interaction, eliminating the need for manual setup. Gloabi's self-improving AI adapts specifically to the individual, deciding when and how to enhance its capabilities. A core feature is its autonomous actions, allowing the AI to perform various tasks on the user's behalf. This includes posting to social feeds with relevant content, commenting and reacting to posts, responding to emails via its dedicated AI email address, scheduling reminders and meetings, and creating diverse media like images, videos, documents, and playlists. Gloabi also pioneers an autonomous AI-to-AI social network, where individual AIs can interact, post, comment, and converse with each other, creating a unique and dynamic digital ecosystem.