AI Agents & Automation
Browsing page 365 of AI Agents & Automation. Sorted by confidence score — our independent quality rating.
awesome-speech-recognition-speech-synthesis-papers
awesome-speech-recognition-speech-synthesis-papers is an open-source GitHub repository that serves as a curated list of academic papers focused on various aspects of speech technology. It covers key areas such as Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis (TTS), Language Modelling, Singing Voice Synthesis (SVS), and Voice Conversion (VC). The repository is organized by topic, making it easy for researchers, academics, and students to find relevant literature. It includes papers ranging from foundational works to recent advancements, often providing direct links to PDF versions. This resource is invaluable for anyone looking to delve into the theoretical and practical developments in speech processing.
Cydoc
Cydoc is an AI-powered scribe and triage assistant specifically developed for urgent care clinicians. This tool is designed to automate the time-consuming processes of history-taking and note writing, significantly reducing the administrative burden on healthcare professionals. By streamlining these tasks, Cydoc aims to save clinicians more than two hours per day, allowing them to focus more on patient care. The platform offers a free trial, enabling users to experience its benefits firsthand before committing. While specific features beyond automated history-taking and note writing are not detailed, the core value proposition revolves around efficiency and time-saving in a clinical setting.
Awesome-Robotics-3D
Awesome-Robotics-3D is a comprehensive, curated list of 3D Vision papers specifically focusing on the intersection of robotics and large models such as Large Language Models (LLMs) and Vision-Language Models (VLMs). Inspired by awesome-computer-vision, this repository serves as a valuable resource for researchers and academics. It categorizes papers into key areas like Policy Learning, Pretraining, VLM and LLM applications, Representations, and Simulations, Datasets, and Benchmarks. Each entry typically includes links to the paper, associated webpages, and code, making it easy for users to access and explore the research. The list is actively maintained and encourages contributions from the community.
ELNA.ai
ELNA.ai is positioned as an AI companion operating on the blockchain, focusing on the development, creation, and monetization of AI agents. While specific features are not detailed on the homepage, the platform's core identity revolves around leveraging blockchain technology for AI agent infrastructure. It aims to provide a decentralized environment for users to interact with and potentially build AI agents, suggesting a focus on the underlying framework and infrastructure for AI within a decentralized ecosystem. The platform's emphasis on blockchain implies aspects like transparency, security, and potentially tokenization or decentralized governance for AI agents.
HostBuddy AI
HostBuddy AI is designed to streamline guest communication for short-term rental hosts. This AI-powered tool automates responses to guest messages, addressing common inquiries and troubleshooting issues efficiently. It integrates with property management systems to provide seamless, automated communication workflows. HostBuddy AI offers valuable features such as key communication metrics, intelligent templated messaging, and the ability to track guest-reported issues, helping hosts maintain high guest satisfaction and operational efficiency. By automating routine interactions, HostBuddy AI allows hosts to focus on other aspects of their business while ensuring guests receive timely and accurate information.
factorie.io
factorie.io, led by Urvesh Goel, is a business consulting service focused on democratizing manufacturing and fostering growth. The platform offers business consulting services and features articles written by Urvesh Goel on a range of topics including execution, understanding emotions, and digital trends like Artificial Intelligence. These articles delve into subjects such as the impact of AI on jobs, decoding intelligence, and intelligent customer engagement. The site also showcases Urvesh Goel's professional experience and education, highlighting his background in digital entrepreneurship and his academic pursuits at the Indian Institute of Technology, Roorkee. The content suggests a focus on strategic insights and practical applications for business leaders.
marlin
Marlin is an extremely optimized FP16xINT4 matrix multiplication kernel specifically designed for Large Language Model (LLM) inference. It aims to deliver close to ideal (4x) speedups for batch sizes up to 16-32 tokens, significantly outperforming prior work that typically achieves comparable speedups only at 1-2 tokens. This makes Marlin particularly well-suited for larger-scale serving, speculative decoding, and advanced multi-inference schemes like CoT-Majority. The kernel employs numerous techniques and optimizations, including organizing computation for efficient L2 cache usage, asynchronous global weight loads, double buffering for shared memory loads, and careful ordering of dequantization and tensor core instructions. It also reshuffles quantized weights and group scales offline for ideal access patterns and uses a "striped" partitioning scheme for good SM utilization across various matrix shapes. Marlin requires CUDA >= 11.8, an NVIDIA GPU with compute capability >= 8.0 (Ampere or Ada), and torch>=2.0.0.
Sahana System Limited
Sahana System Limited is an IT company based in India, specializing in a wide range of technology solutions including AI, cloud, and enterprise services. They provide expertise in areas such as Generative AI, AI and ML, Big Data Analytics, DevOps, CloudOps, MLOPs, Digital Product Engineering, ERP, IoT, Cyber Security, Microsoft solutions, Blockchain, and Embedded Engineering. The company is certified with CMMI Level-5, ISO 9001, and ISO/IEC 27001, ensuring high standards in their service delivery. Sahana System aims to empower businesses with intelligent solutions, focusing on faster time-to-market, lower costs, and higher ROI across various industries like Fintech, Healthcare, Defence, and Manufacturing.
interneuron
Interneuron is an AI consultancy focused on delivering custom artificial intelligence solutions to modern enterprises. The company specializes in integrating AI into existing business operations, offering expertise in the development and implementation of comprehensive AI strategies. Their services are designed to help businesses leverage AI for improved efficiency, decision-making, and innovation. Interneuron aims to bridge the gap between complex AI technologies and practical business applications, ensuring clients can effectively adopt and benefit from advanced AI capabilities tailored to their specific needs.
Awesome-Korean-NLP
Awesome-Korean-NLP is a comprehensive, curated list of resources dedicated to Natural Language Processing (NLP) for the Korean language. This GitHub repository serves as a central hub for various tools, datasets, blogs, research papers, lectures, and online communities relevant to Korean NLP. It includes specific sections for morpheme/PoS taggers, named entity taggers, spell checkers, syntax parsers, sentimental analysis tools, translators, and general NLP packages. The resource also lists significant Korean datasets like Sejong Corpus and Wikipedia Dump, alongside academic papers and lectures from prominent institutions. It's an invaluable resource for anyone working with Korean language data, from academic researchers to developers building Korean NLP applications.
awesome-langchain
awesome-langchain is a curated list of tools and projects that leverage the LangChain framework, designed to assist developers in building applications with Large Language Models (LLMs). The repository serves as a dynamic tracker for initiatives within the LangChain ecosystem, which is expanding at a rapid pace. It provides a comprehensive overview of various components, including LangChain framework ports to other languages, low-code tools, services, agents, templates, and open-source projects for knowledge management and chatbots. This resource is invaluable for developers looking to explore, implement, and stay informed about the latest advancements and tools available for LangChain.
awesome-llm-agents
awesome-llm-agents is a comprehensive, curated list of open-source LLM agent frameworks and development tools designed to assist developers in building sophisticated AI agents. The repository features a wide array of frameworks, each detailed with its key characteristics, such as multi-agent collaboration, modular architecture, data analysis capabilities, and integration with various LLM providers. It includes popular tools like CrewAI, Langchain, Microsoft AutoGen, and Llama Index, alongside specialized frameworks for areas like software development (MetaGPT), scientific discovery (GenoMAS), and robotics (RAI). The list is regularly updated and serves as a valuable resource for anyone looking to explore or implement LLM agent technologies, offering insights into different approaches to agent design, workflow orchestration, and tool integration.
awesome-online-machine-learning
awesome-online-machine-learning is a comprehensive, open-source curated list of resources dedicated to online machine learning. This field focuses on machine learning where data arrives sequentially, allowing models to update incrementally with one data point at a time, contrasting with traditional batch learning. The repository provides valuable links to courses, books, blog posts, and software related to online ML. It also features an extensive collection of research papers covering various online learning topics such as linear models, support vector machines, neural networks, decision trees, unsupervised learning, time series analysis, drift detection, and anomaly detection. This resource is ideal for anyone looking to deepen their understanding or find tools for online machine learning.
Xpress AI
Xpress AI offers an enterprise operating system for AI agents, transforming AI potential into measurable results by deploying managed digital workforces. It addresses common pain points of AI agent deployment, such as complex setup, reliability issues, and lack of trust. The platform enables users to name agents, assign roles, and have them perform tasks like SDRs, content managers, or DevOps engineers, without requiring extensive technical knowledge or dedicated hardware. Xpress AI features isolated container environments for safety, persistent memory systems for agents, and platform-level integrations for seamless workflow. It also provides XpressCLAW, a free, open-source agent runtime for local deployment.
Kibsi
Kibsi is a powerful computer vision platform designed to democratize AI and computer vision capabilities for various industries. It enables users to build and deploy computer vision solutions rapidly, leveraging existing cameras to generate real-time insights. The platform offers thousands of built-in detectors and allows for custom models, supporting deployment in the cloud, at the edge, or both. Kibsi helps enhance efficiency, improve safety, and reduce operational costs by transforming passive video feeds into actionable data. It provides instant app solutions for specific use cases like forklift safety, quality inspection, and production line monitoring, making it a versatile tool for optimizing operations across manufacturing, supply chain, transportation, and more.
HitoAI Limited
HitoAI Limited is an AI firm dedicated to crafting, developing, and implementing advanced AI technologies across various business sectors. The company's core mission is to revolutionize business operations by delivering innovative AI solutions designed to enhance efficiency, boost productivity, and foster growth. While specific features are not detailed on the current website, the company's focus is on providing comprehensive AI frameworks and infrastructure to support diverse business needs. This suggests a commitment to foundational AI development and deployment, rather than end-user applications.
Flexday AI
Flexday AI provides an Enterprise AI solution that utilizes autonomous AI agents to orchestrate, analyze, and submit mission-critical insights from your data ecosystem. It offers unified data access, seamlessly connecting to all enterprise data sources, including structured, unstructured, and AI models. The platform facilitates enterprise process flow, delivering the right information to internal teams and external users precisely when needed, with omnichannel accessibility. Flexday AI solutions benefit various teams, including procurement, customer service, human resources, legal, sales & marketing, IT service delivery, supply chain logistics, and education. The platform emphasizes data security with features like encryption, access controls, regular security audits, and secure cloud infrastructure.
Forage Mail
Forage Mail is an AI-powered email management solution designed to combat email overwhelm by intelligently filtering and organizing your inbox. It operates invisibly within Gmail, identifying priority messages from real humans and time-sensitive alerts, while sweeping less important emails into a concise daily summary. Key features include automatic email labeling, bulk unsubscribing from unwanted senders, and the ability to clean out thousands of unread emails with its Deep Clean function. Forage also provides bullet-point summaries of newsletters, allowing users to quickly grasp key information. The tool aims to save users 1-2 hours daily by streamlining their email workflow and ensuring they only focus on messages that truly matter, all while respecting user privacy by not processing personal messages with AI unless opted in.
U-KAN
U-KAN is an official PyTorch implementation designed to serve as a robust backbone for medical image segmentation and generation. It leverages the innovative Kolmogorov-Anold Network (KAN) layers, integrating them into the established U-Net pipeline. This integration, termed U-KAN, aims to enhance accuracy and efficiency in medical imaging tasks. The tool has demonstrated superiority in rigorous medical image segmentation benchmarks, achieving higher accuracy with reduced computational cost. Furthermore, U-KAN explores its potential as an alternative U-Net noise predictor in diffusion models, showcasing its applicability in generating task-oriented model architectures. It is the first effort to incorporate KAN's advantages into the U-Net pipeline, offering a more accurate, efficient, and interpretable solution for vision tasks.
Kisui (輝翠)
Kisui (輝翠) is an innovative company developing AI robotic services specifically designed for small to medium-sized family farmers. Their flagship product, Adam, is an autonomous AI robot engineered to revolutionize agricultural practices. Adam, along with its smaller counterpart Mini Adam, can operate without human intervention, directly addressing critical issues like labor shortages and an aging workforce in the agricultural sector. The platform also includes MyNojo, an agricultural DX (Digital Transformation) foundation that helps make farms smarter. Kisui offers various attachments for Adam, enabling it to perform diverse tasks, making it a versatile solution for modern farming needs.
AI Bookmarker
AI Bookmarker is an AI-powered browser extension designed to optimize bookmark management. It automatically generates tags and summaries for saved web pages, supporting multiple AI models for enhanced functionality. Users can effortlessly organize and retrieve online content through powerful full-text search capabilities. The tool offers seamless integration with Notion for syncing bookmarks and the ability to save original web content, including videos from platforms like X (Twitter), as Markdown. Additionally, it provides one-click synchronization to NotebookLM, enabling users to build a personal knowledge base. With cloud backup and encrypted data, AI Bookmarker ensures both accessibility and privacy for your digital resources.
Fusion AI
Fusion AI simplifies the complex AI landscape by integrating top-tier AI models from OpenAI, Anthropic, and Google, including o4 mini, GPT-5, Sonnet 4, and Gemini 2.5 Pro, into one unified platform. This ensemble approach ensures users receive the best possible results by leveraging the unique strengths of each model. The platform is designed for ease of use, allowing users to simply tell Fusion AI what they need, and it assembles the perfect team of AI models to collaborate and deliver refined solutions. Fusion AI operates on a usage-based credit system, eliminating subscriptions and hidden fees, making it a flexible and transparent solution for both small businesses and large enterprises.
GoalMentorAI
GoalMentorAI is an AI-powered platform designed to help users achieve their goals by breaking down ambitions into manageable daily tasks. It offers personalized AI guidance, creating custom plans tailored to individual needs. The tool provides motivational support and progress tracking, making goal-setting effortless across diverse areas such as language learning, career growth, financial planning, fitness, and business launches. GoalMentorAI acts as a 24/7 AI strategist, ensuring users stay on track and receive the necessary support to turn their aspirations into reality, whether they are learning a new skill or starting a business.
Tenalog
Tenalog is an AI-powered documentation system designed for therapists, including Speech-Language Pathologists (SLPs), Occupational Therapists (OTs), and Physical Therapists (PTs). It revolutionizes clinical documentation by automatically generating detailed session transcripts, in-depth SOAP notes, and automated progress tracking. The tool also provides analysis of articulation errors down to the phoneme level and creates parent-friendly summaries of progress. Tenalog aims to free up therapists to focus on patient care by capturing session nuances without tedious manual note-taking, helping to avoid clinician burnout and achieve better outcomes. It supports audio and video file uploads, and is HIPAA compliant.