What Integrates with OpenRouter?
Find out what OpenRouter integrations exist in 2026. Learn what software and services currently integrate with OpenRouter, and sort them by reviews, cost, features, and more. Below is a list of products that OpenRouter currently integrates with:
-
1
GLM-4.6
Zhipu AI
FreeGLM-4.6 builds upon the foundations laid by its predecessor, showcasing enhanced reasoning, coding, and agent capabilities, resulting in notable advancements in inferential accuracy, improved tool usage during reasoning tasks, and a more seamless integration within agent frameworks. In comprehensive benchmark evaluations that assess reasoning, coding, and agent performance, GLM-4.6 surpasses GLM-4.5 and competes robustly against other models like DeepSeek-V3.2-Exp and Claude Sonnet 4, although it still lags behind Claude Sonnet 4.5 in terms of coding capabilities. Furthermore, when subjected to practical tests utilizing an extensive “CC-Bench” suite that includes tasks in front-end development, tool creation, data analysis, and algorithmic challenges, GLM-4.6 outperforms GLM-4.5 while nearing parity with Claude Sonnet 4, achieving victory in approximately 48.6% of direct comparisons and demonstrating around 15% improved token efficiency. This latest model is accessible through the Z.ai API, providing developers the flexibility to implement it as either an LLM backend or as the core of an agent within the platform's API ecosystem. In addition, its advancements could significantly enhance productivity in various application domains, making it an attractive option for developers looking to leverage cutting-edge AI technology. -
2
Gemini Enterprise
Google
$21 per monthGemini Enterprise app is a comprehensive agentic AI platform designed to improve productivity and collaboration across organizations. It enables users to connect various workplace tools and data sources, providing a unified environment for searching, analyzing, and generating content. The platform supports multi-step automation through AI agents that can perform tasks across different applications without manual intervention. Users can leverage prebuilt Google agents or create custom agents using a no-code interface, making AI accessible to both technical and non-technical teams. Gemini Enterprise app also offers centralized control over data access, permissions, and workflows, ensuring secure and compliant operations. It is suitable for various departments, including marketing, sales, engineering, HR, and finance. By grounding AI outputs in enterprise data, it delivers more accurate and relevant results. Overall, it helps organizations operate more efficiently and make data-driven decisions. -
3
Novelcrafter
Novelcrafter
$4.64 per monthNovelcrafter is an innovative writing platform powered by AI, designed to assist authors at every stage of their storytelling journey, encompassing everything from brainstorming ideas and developing characters to drafting, reviewing, and finalizing their written works. The platform features a specialized “Codex” wiki that allows writers to organize essential elements such as characters, settings, lore, and world-building details, promoting consistency and ease of access. Additionally, it provides various structured planning modes that include acts, chapters, and scenes, enabling authors to transition effortlessly between the planning phase and the writing interface. Authors have the flexibility to utilize customizable AI tools, allowing them to link their own API keys (such as OpenAI, Claude, or local LLMs) and create specific prompts, or they can choose to write manually without any AI assistance. Furthermore, Novelcrafter boasts a distraction-free writing mode, keeps a revision history, supports the import and export of documents in formats like Word, Markdown, and HTML, and offers mobile compatibility for writers who need to jot down ideas while on the move. This platform seeks to empower writers by providing a comprehensive suite of tools tailored to enhance creativity and streamline the writing process. -
4
Scraib
Scraib
$3.99 per monthScraib.app is a macOS writing assistant powered by AI that resides in the menu bar, allowing users to select text from any application and improve it by pressing Control + R, which enhances grammar, clarity, and style. Users have the flexibility to set custom rules to align with their preferred tone, and unlike other writing software that requires switching between applications, Scraib seamlessly integrates with various platforms, including Slack, Outlook, Pages, Word, Chrome, and Figma. It prioritizes user privacy by offering options to work with different AI providers like ChatGPT, Claude, and others, while also allowing for local operation with supported models, ensuring that sensitive data remains secure. Designed for efficiency, it minimizes workflow interruptions, enabling users to refine their text without leaving their current application, making it an ideal tool for enhancing written communication on the fly. Additionally, Scraib's intuitive shortcut-based system enhances productivity, allowing for quick adjustments and refinements directly where the text exists. -
5
Apollo
Liquid AI
FreeApollo is a streamlined mobile application that facilitates completely on-device, cloud-independent AI interactions, allowing users to interact with sophisticated language and vision models in a secure, private manner with minimal delays. It features a collection of compact foundation models sourced from the company's LEAP platform, enabling users to compose messages, send emails, converse with a personal AI assistant, create digital characters, or utilize image-to-text functions, all while maintaining offline capabilities and ensuring no data is transmitted beyond the device. Optimized for immediate responsiveness and offline functionality, Apollo guarantees that all inference occurs locally, eliminating the need for API calls, external servers, or logging of user data. This application acts as both a personal AI exploration tool and a development environment for those utilizing LEAP models, allowing users to effectively assess a model's performance on their specific mobile devices prior to more widespread implementation. Additionally, Apollo's design emphasizes user autonomy, ensuring a seamless experience free from external interruptions or privacy concerns. -
6
GLM-4.6V
Zhipu AI
FreeThe GLM-4.6V is an advanced, open-source multimodal vision-language model that belongs to the Z.ai (GLM-V) family, specifically engineered for tasks involving reasoning, perception, and action. It is available in two configurations: a comprehensive version with 106 billion parameters suitable for cloud environments or high-performance computing clusters, and a streamlined “Flash” variant featuring 9 billion parameters, which is tailored for local implementation or scenarios requiring low latency. With a remarkable native context window that accommodates up to 128,000 tokens during its training phase, GLM-4.6V can effectively manage extensive documents or multimodal data inputs. One of its standout features is the built-in Function Calling capability, allowing the model to accept various forms of visual media — such as images, screenshots, and documents — as inputs directly, eliminating the need for manual text conversion. This functionality not only facilitates reasoning about the visual content but also enables the model to initiate tool calls, effectively merging visual perception with actionable results. The versatility of GLM-4.6V opens the door to a wide array of applications, including the generation of interleaved image-and-text content, which can seamlessly integrate document comprehension with text summarization or the creation of responses that include image annotations, thereby greatly enhancing user interaction and output quality. -
7
GLM-4.1V
Zhipu AI
FreeGLM-4.1V is an advanced vision-language model that offers a robust and streamlined multimodal capability for reasoning and understanding across various forms of media, including images, text, and documents. The 9-billion-parameter version, known as GLM-4.1V-9B-Thinking, is developed on the foundation of GLM-4-9B and has been improved through a unique training approach that employs Reinforcement Learning with Curriculum Sampling (RLCS). This model accommodates a context window of 64k tokens and can process high-resolution inputs, supporting images up to 4K resolution with any aspect ratio, which allows it to tackle intricate tasks such as optical character recognition, image captioning, chart and document parsing, video analysis, scene comprehension, and GUI-agent workflows, including the interpretation of screenshots and recognition of UI elements. In benchmark tests conducted at the 10 B-parameter scale, GLM-4.1V-9B-Thinking demonstrated exceptional capabilities, achieving the highest performance on 23 out of 28 evaluated tasks. Its advancements signify a substantial leap forward in the integration of visual and textual data, setting a new standard for multimodal models in various applications. -
8
GLM-4.5V-Flash
Zhipu AI
FreeGLM-4.5V-Flash is a vision-language model that is open source and specifically crafted to integrate robust multimodal functionalities into a compact and easily deployable framework. It accommodates various types of inputs including images, videos, documents, and graphical user interfaces, facilitating a range of tasks such as understanding scenes, parsing charts and documents, reading screens, and analyzing multiple images. In contrast to its larger counterparts, GLM-4.5V-Flash maintains a smaller footprint while still embodying essential visual language model features such as visual reasoning, video comprehension, handling GUI tasks, and parsing complex documents. This model can be utilized within “GUI agent” workflows, allowing it to interpret screenshots or desktop captures, identify icons or UI components, and assist with both automated desktop and web tasks. While it may not achieve the performance enhancements seen in the largest models, GLM-4.5V-Flash is highly adaptable for practical multimodal applications where efficiency, reduced resource requirements, and extensive modality support are key considerations. Its design ensures that users can harness powerful functionalities without sacrificing speed or accessibility. -
9
GLM-4.5V
Zhipu AI
FreeGLM-4.5V is an evolution of the GLM-4.5-Air model, incorporating a Mixture-of-Experts (MoE) framework that boasts a remarkable total of 106 billion parameters, with 12 billion specifically dedicated to activation. This model stands out by delivering top-tier performance among open-source vision-language models (VLMs) of comparable scale, demonstrating exceptional capabilities across 42 public benchmarks in diverse contexts such as images, videos, documents, and GUI interactions. It offers an extensive array of multimodal functionalities, encompassing image reasoning tasks like scene understanding, spatial recognition, and multi-image analysis, alongside video comprehension tasks that include segmentation and event recognition. Furthermore, it excels in parsing complex charts and lengthy documents, facilitating GUI-agent workflows through tasks like screen reading and desktop automation, while also providing accurate visual grounding by locating objects and generating bounding boxes. Additionally, the introduction of a "Thinking Mode" switch enhances user experience by allowing the selection of either rapid responses or more thoughtful reasoning based on the situation at hand. This innovative feature makes GLM-4.5V not only versatile but also adaptable to various user needs. -
10
GLM-4.7
Zhipu AI
FreeGLM-4.7 is a next-generation AI model built to serve as a powerful coding and reasoning partner. It improves significantly on its predecessor across software engineering, multilingual coding, and terminal interaction benchmarks. GLM-4.7 introduces enhanced agentic behavior by thinking before tool use or execution, improving reliability in long and complex tasks. The model demonstrates strong performance in real-world coding environments and popular coding agents. GLM-4.7 also advances visual and frontend generation, producing modern UI designs and well-structured presentation slides. Its improved tool-use capabilities allow it to browse, analyze, and interact with external systems more effectively. Mathematical and logical reasoning have been strengthened through higher benchmark performance on challenging exams. The model supports flexible reasoning modes, allowing users to trade latency for accuracy. GLM-4.7 can be accessed via Z.ai, OpenRouter, and agent-based coding tools. It is designed for developers who need high performance without excessive cost. -
11
Repo Prompt
Repo Prompt
$14.99 per monthRepo Prompt is an AI coding assistant designed specifically for macOS, which serves as a context engineering tool that empowers developers to interact with and refine codebases through the use of large language models. By enabling users to select particular files or directories, it allows for the creation of structured prompts that contain only the most relevant context, thereby facilitating the review and application of AI-generated code alterations as diffs instead of requiring rewrites of entire files, which ensures meticulous and traceable modifications. Additionally, it features a visual file explorer for efficient project navigation, an intelligent context builder, and CodeMaps that minimize token usage while enhancing the models' comprehension of project structures. Users benefit from multi-model support, enabling them to utilize their own API keys from various providers such as OpenAI, Anthropic, Gemini, and Azure, ensuring that all processing remains local and private unless the user chooses to send code to a language model. Repo Prompt is versatile, functioning as both an independent chat/workflow interface and as an MCP (Model Context Protocol) server, allowing for seamless integration with AI editors, making it an essential tool in modern software development. Overall, its robust features significantly streamline the coding process while maintaining a strong emphasis on user control and privacy. -
12
Knolli
Knolli
$39 per monthKnolli serves as an AI copilot platform that allows users to create, deploy, and expand tailored AI copilots and agents without the necessity of coding by converting knowledge, documents, datasets, and proprietary materials into engaging, conversational assistants. This platform features a no-code workspace where individuals, teams, and businesses can articulate their concepts in simple terms, enabling Knolli to automatically organize uploaded materials into a functional AI copilot. Additionally, it ensures data is organized and safeguarded through encrypted private knowledge bases while seamlessly integrating with tools like CRMs, file storage systems, and databases to provide real-time data for contextually relevant interactions. Knolli accommodates a multi-agent framework that allows various specialized agents to operate within a single copilot, offers pre-designed templates for frequent scenarios, and supports custom branding and white-label solutions. Users can also benefit from comprehensive analytics to track performance, usage metrics, and return on investment. Moreover, Knolli enhances productivity by providing workflow automation, which empowers copilots to carry out complex tasks and synchronize with current systems effortlessly. This robust set of features makes Knolli a versatile solution for organizations looking to leverage AI effectively. -
13
TexTab
TexTab
FreeTexTab is a productivity application designed for macOS that empowers users to convert AI-driven tasks into instant keyboard shortcuts, facilitating efficient text processing and automation without the need to switch between different applications. It functions at the system level, allowing users to highlight text in any macOS program, such as web browsers, email clients, coding environments, and documents, and execute AI actions with just one keystroke, streamlining tasks like translation, summarization, rewriting, or formalization into easily accessible commands. Users have the flexibility to create an unlimited number of customized AI actions, each with its distinct shortcut, and can connect to various AI providers—including OpenAI, Anthropic, Groq, Perplexity, or OpenRouter—using their personal API keys, ensuring that their data remains confidential and expenses are managed effectively; the API requests are sent directly to the provider without routing through TexTab’s servers. Additionally, the application boasts features such as a one-click AI prompt enhancer, built-in plugins like a pop-up AI chat, a QR code generator, an image converter, and a color picker, all designed to enhance user experience and productivity. This comprehensive suite of tools makes TexTab an invaluable asset for anyone looking to leverage AI capabilities seamlessly within their workflow. -
14
Agent Zero
Agent Zero
$2.65 per monthAgent Zero is an innovative open source framework for AI agents that enables the development of autonomous assistants capable of executing intricate tasks through direct interaction with computer systems. This platform offers a unique setting where AI agents can access real system functions, empowering them to run commands, write and execute code, navigate the internet, analyze data, and oversee workflows as part of comprehensive automation solutions. Unlike a standard chat interface, Agent Zero operates within its isolated virtual environment, enabling it to engage with the operating system, install necessary tools, run scripts, and manage tasks across various components seamlessly. The framework prioritizes transparency and developer control, allowing users to monitor, adjust, and personalize agent behavior, tool accessibility, and information processing methods. With a modular architecture, Agent Zero facilitates the dynamic creation and utilization of tools, all while maintaining a consistent memory for enhanced performance. This makes it an ideal choice for developers aiming to build highly customizable and efficient AI-driven workflows. -
15
AI SpendOps
AI SpendOps
£199We provide a unified platform for engineering, finance, and FinOps teams to monitor, allocate, and enhance spending on LLM APIs from various providers. Expenses are categorized based on customizable dimensions that align with your organization's financial reporting practices. Engineering teams experience seamless cost monitoring that doesn't impede their workflow. CTOs benefit from a consolidated view that facilitates model governance and mitigates unauthorized usage. CFOs receive high-quality financial reports for accurate forecasting, budgeting, and chargebacks, all tailored to their specific reporting frameworks. FinOps teams have access to real-time cost information across multiple providers, integrating effortlessly into their existing cloud management processes. When your organization utilizes LLM APIs and the board inquires about spending and its justification, we serve as the definitive solution to those questions. Furthermore, our platform empowers teams to make informed financial decisions, increasing accountability and optimizing resource allocation. -
16
Hermes Agent
Nous Research
FreeHermes Agent by Nous Research is an open-source autonomous AI system designed to function as a persistent, self-improving digital assistant. Unlike traditional AI tools, it runs locally on your server, allowing it to retain memory and learn from ongoing interactions over time. The agent integrates with multiple communication platforms, including Slack, Discord, Telegram, and WhatsApp, enabling seamless cross-platform usage. Hermes supports automation through natural language scheduling, allowing users to set up recurring tasks such as reports, backups, and updates. It can execute complex workflows by delegating tasks to subagents that operate in parallel environments. The platform includes advanced capabilities like web browsing, search, code execution, and multimedia processing. It also offers secure sandboxing options across multiple environments, including Docker and local systems. Hermes can be configured to use different AI models or APIs, giving users flexibility in deployment. Its command-line interface provides a powerful and customizable interaction experience. Overall, Hermes Agent delivers a scalable, adaptable, and intelligent solution for automation and task management. -
17
Octrafic
Octrafic
FreeOctrafic is a command-line tool that leverages AI and is available as open source, aimed at simplifying the process of automated API testing and exploration by allowing users to communicate with APIs in natural language rather than having to write complex scripts or set up intricate testing frameworks. By simply directing the tool to any HTTP API or OpenAPI specification, users can articulate their testing requirements in straightforward English, prompting the integrated AI agent to create test scenarios, perform actual HTTP requests, verify responses, and generate organized results. This tool streamlines the entire testing process, encompassing endpoint discovery, request formulation, schema checks, and error identification, which enables developers to prioritize testing logic without getting bogged down by the underlying implementation specifics. Additionally, it accommodates real-time execution against live APIs, ensuring the accuracy of status codes and behaviors without the need for mock setups, and it can also produce aesthetically formatted PDF reports for effective communication with teams or stakeholders. With its user-friendly approach, Octrafic represents a significant advancement in making API testing more accessible and efficient. -
18
Trinity-Large-Thinking
Arcee AI
FreeTrinity Large Thinking is an innovative open-source reasoning model crafted by Arcee AI, tailored for intricate, multi-step problem solving and workflows involving autonomous agents that necessitate extended planning and the use of various tools. This model features a sparse Mixture-of-Experts architecture, boasting a remarkable total of around 400 billion parameters, with approximately 13 billion being active for each token, which enhances its efficiency while ensuring robust reasoning capabilities across a range of tasks, including mathematical calculations, code generation, and comprehensive analysis. A notable advancement in this model is its ability to perform extended chain-of-thought reasoning, which allows it to produce intermediate "thinking traces" prior to delivering final solutions, thereby boosting accuracy and reliability in complex situations. Furthermore, Trinity Large Thinking accommodates a substantial context window of up to 262K tokens, allowing it to effectively process lengthy documents, retain context during prolonged interactions, and function seamlessly in continuous agent loops. This model's design reflects a commitment to pushing the boundaries of what automated reasoning systems can achieve. -
19
GPT-5.5
OpenAI
$5 per 1M tokens (input)GPT-5.5 is a next-generation AI system built for execution-heavy workflows across coding, research, business analysis, and scientific tasks. It can interpret complex instructions, break them into actionable steps, and carry them through to completion while interacting with tools and systems. The model supports creating applications, generating reports, analyzing datasets, and navigating software environments seamlessly. It also integrates with workspace agents—custom AI agents that automate recurring and multi-step processes across teams. These agents can handle tasks such as lead research, reporting, and workflow automation, either on demand or on schedules. GPT-5.5 enhances productivity by reducing manual effort and enabling continuous task execution across tools. With enterprise-grade safeguards and monitoring, it ensures secure and controlled automation. It is well-suited for organizations looking to scale operations and improve efficiency through AI-driven workflows. -
20
GPT-5.5 Pro
OpenAI
$30 per 1M tokens (input)GPT-5.5 Pro is a next-generation AI model built for execution-heavy tasks across coding, research, business analysis, and scientific workflows. It can interpret complex instructions, break them into steps, and carry work through to completion using tools and automation. The model supports tasks such as generating documents, building applications, analyzing datasets, and navigating software environments. It is designed to operate across tools, enabling seamless workflows from idea to output. In addition, GPT-5.5 Pro integrates with workspace agents—customizable AI agents that automate recurring and multi-step processes across teams. These agents can handle tasks like lead research, reporting, and workflow automation, running independently or on schedules. Built with enterprise-grade safeguards, the model ensures secure and controlled automation. It helps organizations improve productivity by reducing manual effort and accelerating decision-making. GPT-5.5 Pro is ideal for teams looking to scale operations and handle complex workloads efficiently. -
21
Sourcebot
Sourcebot
FreeSourcebot is a self-hosted platform for code comprehension that assists developers and AI agents in searching, navigating, and reasoning through extensive codebases, regardless of their size. It allows teams to index repositories from various sources, including GitHub, GitLab, and Bitbucket, enabling exploration through a cohesive interface that offers quick, multi-repository searches with sophisticated filtering options, regex capabilities, and queries tailored to specific programming languages. The platform features an "ask mode" that lets users ask questions in everyday language, while an integrated language model scours the indexed code, tracks references, and provides structured responses with inline citations linked to the corresponding code snippets. Beyond search functionalities, Sourcebot enhances the development experience by incorporating IDE-level navigation tools such as go-to-definition and find-references across all repositories, as well as a built-in file explorer that showcases syntax highlighting and complete visibility of code. This comprehensive set of features empowers developers to work more efficiently and gain deeper insights into their projects. -
22
OllaCoder
OllaCoder
FreeOllaCoder serves as a private AI coding assistant tailored for VS Code, catering specifically to developers who prefer not to upload their source code to external servers. Operating locally, it utilizes your personal Ollama models and integrates features such as agent mode, inline edits, codebase chat, intelligent autocomplete, MCP servers, and a local-first runtime all within a single editor interface. The core philosophy behind OllaCoder emphasizes the notion that software development is a personal endeavor, asserting that your code should remain under your control while providing an AI assistant that is robust, transparent, and unobtrusive. It primarily communicates with your local Ollama instance, ensuring that prompts, completions, and modifications remain on your device; cloud services are optional, with API keys securely stored in the OS keychain. OllaCoder's agent mode is capable of planning tasks, modifying files, executing terminal commands, and confirming the accuracy of its work, allowing users to approve, reject, or revert any action taken. Additionally, the inline edits feature enables users to select a function, specify the desired change, and examine a real diff change by change, enhancing the coding experience. Overall, OllaCoder represents a significant step forward in maintaining code privacy while providing powerful AI-assisted development tools. -
23
GPT-5
OpenAI
$1.25 per 1M tokensOpenAI’s GPT-5 represents the cutting edge in AI language models, designed to be smarter, faster, and more reliable across diverse applications such as legal analysis, scientific research, and financial modeling. This flagship model incorporates built-in “thinking” to deliver accurate, professional, and nuanced responses that help users solve complex problems. With a massive context window and high token output limits, GPT-5 supports extensive conversations and intricate coding tasks with minimal prompting. It introduces advanced features like the verbosity parameter, enabling users to control the detail and tone of generated content. GPT-5 also integrates seamlessly with enterprise data sources like Google Drive and SharePoint, enhancing response relevance with company-specific knowledge while ensuring data privacy. The model’s improved personality and steerability make it adaptable for a wide range of business needs. Available in ChatGPT and API platforms, GPT-5 brings expert intelligence to every user, from casual individuals to large organizations. Its release marks a major step forward in AI-assisted productivity and collaboration. -
24
AppFit
AppFit
AppFit offers a comprehensive suite of tools to take your concepts from inception to launch, ensuring your web and mobile application development is a success. With AI support integrated throughout the entire development journey, you can effortlessly build full-stack applications while generating code, designing intuitive interfaces, and troubleshooting challenges more efficiently than ever. Leverage AI-driven market insights and analytics to validate your app ideas, helping you quickly identify the right product-market fit. Gain a deep understanding of your target audience and competitive landscape even before the first line of code is written. Our gamified no-code editor facilitates learning as you create, offering engaging, bite-sized lessons akin to how Duolingo teaches languages. With AppFit, you can seamlessly develop responsive web applications and mobile apps that feel native, all from a single codebase. This approach not only conserves time and resources but also broadens your reach to users across a multitude of devices, enhancing your application's accessibility and impact. Additionally, our platform empowers you to innovate and iterate rapidly, ensuring your app remains relevant in an ever-changing market. -
25
SheetMagic
SheetMagic
$19 per monthSheetMagic is an innovative Google Sheets add-on that integrates unlimited AI content creation and web scraping capabilities directly into your spreadsheets. This powerful tool allows users to generate content and images through simple formulas, utilizing advanced models like GPT-3.5 Turbo, GPT-4/GPT-4 Turbo/GPT-4o, DALL·E 3, and any other LLM via OpenRouter, all without the need for coding or additional markup costs. With SheetMagic, you can efficiently clean, analyze, summarize, and categorize your data; scrape comprehensive information from entire web pages, search engine results, meta titles, headings, and custom selectors; and automate the generation of bulk product descriptions, advertising copy, sales emails, SEO-friendly content, and enriched lead lists based on your existing sheet data and scraped information. This add-on also facilitates programmatic workflows, supports multi-language prompts, and allows for team collaboration with sharing capabilities, audit trails, and real-time dashboards, thereby simplifying repetitive tasks and enabling you to concentrate on strategic initiatives rather than manual data entry. By harnessing the power of AI and automation, SheetMagic significantly enhances productivity and efficiency for users across various industries. -
26
Gemini 2.5 Flash Image
Google
The Gemini 2.5 Flash Image is Google's cutting-edge model for image creation and modification, now available through the Gemini API, build mode in Google AI Studio, and Gemini Enterprise Agent Platform. This model empowers users with remarkable creative flexibility, allowing them to seamlessly merge various input images into one cohesive visual, ensure character or product consistency throughout edits for enhanced storytelling, and execute detailed, natural-language transformations such as object removal, pose adjustments, color changes, and background modifications. Drawing from Gemini’s extensive knowledge of the world, the model can comprehend and reinterpret scenes or diagrams contextually, paving the way for innovative applications like educational tutors and scene-aware editing tools. Showcased through customizable template applications in AI Studio, which includes features such as photo editors, multi-image merging, and interactive tools, this model facilitates swift prototyping and remixing through both prompts and user interfaces. With its advanced capabilities, Gemini 2.5 Flash Image is set to revolutionize the way users approach creative visual projects. -
27
ShipAhead
Tom Han
$99 one timeShipAhead is designed for developers and startups that want to skip the painful setup phase and move straight to building real features. Its comprehensive Nuxt boilerplate includes authentication options like email/password, OAuth, and magic link login, making onboarding effortless. It also integrates Stripe for payments, supports subscriptions, and handles multi-currency billing right out of the box. Security features such as Cloudflare Turnstile captcha and Redis-based rate limiting protect your app from abuse. On the backend, ShipAhead includes a Postgres + Drizzle ORM setup, file storage with Cloudflare R2 or AWS S3, cron jobs for automation, and a super admin dashboard for user and content management. Frontend developers benefit from responsive UI components, pre-built layouts, and PWA support for installable apps. Additional perks like affiliate support, customer chat widgets, pre-written legal pages, and AI-powered templates further reduce overhead. With lifetime access pricing, ShipAhead is a one-time investment that saves developers hundreds of hours while providing continuous updates. -
28
Raptor Write
Raptor Write
FreeRaptor Write is a complimentary writing assistant powered by AI, developed by the Future Fiction Academy, aimed at aiding writers in brainstorming, outlining, and drafting their narratives with ease. Its user-friendly, distraction-minimized design allows authors to concentrate on their creative ideas rather than getting bogged down by complex tools. All work is securely stored within the user’s browser, granting them greater autonomy over their projects. By utilizing OpenRouter, the tool permits users to integrate various AI models and test different writing styles. Although it is straightforward and lightweight, it lacks some of the more advanced structural features available in more robust writing platforms. Nevertheless, it serves as an inviting, cost-free option for writers eager to delve into the integration of AI into their creative processes. With its approachable design and functionalities, it encourages experimentation and innovation among aspiring authors. -
29
GPT-5.1
OpenAI
The latest iteration in the GPT-5 series, known as GPT-5.1, aims to significantly enhance the intelligence and conversational abilities of ChatGPT. This update features two separate model types: GPT-5.1 Instant, recognized as the most popular option, is characterized by a warmer demeanor, improved instruction adherence, and heightened intelligence; on the other hand, GPT-5.1 Thinking has been fine-tuned as an advanced reasoning engine, making it easier to grasp, quicker for simpler tasks, and more diligent when tackling complex issues. Additionally, queries from users are now intelligently directed to the model variant that is best equipped for the specific task at hand. This update not only focuses on boosting raw cognitive capabilities but also on refining the communication style, resulting in models that are more enjoyable to interact with and better aligned with users' intentions. Notably, the system card addendum indicates that GPT-5.1 Instant employs a feature called "adaptive reasoning," allowing it to determine when deeper thought is necessary before formulating a response, while GPT-5.1 Thinking adjusts its reasoning time precisely in relation to the complexity of the question posed. Ultimately, these advancements mark a significant step forward in making AI interactions more intuitive and user-friendly. -
30
Gemini 3 Pro Image
Google
Gemini Image Pro is an advanced multimodal system for generating and editing images, allowing users to craft, modify, and enhance visuals using natural language prompts or by integrating various input images. This platform ensures uniformity in character and object representation throughout edits and offers detailed local modifications, including background blurring, object removal, style transfers, or pose alterations, all while leveraging inherent world knowledge for contextually relevant results. Furthermore, it facilitates the fusion of multiple images into a single, cohesive new visual and prioritizes design workflow elements, featuring template-based outputs, consistency in brand assets, and the ability to maintain recurring character or style appearances across different scenes. Additionally, the system incorporates digital watermarking to identify AI-generated images and is accessible via Gemini API, Google AI Studio, and Gemini Enterprise Agent Platform, making it a versatile tool for creators across various industries. With its robust capabilities, Gemini Image Pro is set to revolutionize the way users interact with image generation and editing technologies. -
31
LFM2
Liquid AI
LFM2 represents an advanced series of on-device foundation models designed to provide a remarkably swift generative-AI experience across a diverse array of devices. By utilizing a novel hybrid architecture, it achieves decoding and pre-filling speeds that are up to twice as fast as those of similar models, while also enhancing training efficiency by as much as three times compared to its predecessor. These models offer a perfect equilibrium of quality, latency, and memory utilization suitable for embedded system deployment, facilitating real-time, on-device AI functionality in smartphones, laptops, vehicles, wearables, and various other platforms, which results in millisecond inference, device durability, and complete data sovereignty. LFM2 is offered in three configurations featuring 0.35 billion, 0.7 billion, and 1.2 billion parameters, showcasing benchmark results that surpass similarly scaled models in areas including knowledge recall, mathematics, multilingual instruction adherence, and conversational dialogue assessments. With these capabilities, LFM2 not only enhances user experience but also sets a new standard for on-device AI performance. -
32
GPT-5.2 Thinking
OpenAI
The GPT-5.2 Thinking variant represents the pinnacle of capability within OpenAI's GPT-5.2 model series, designed specifically for in-depth reasoning and the execution of intricate tasks across various professional domains and extended contexts. Enhancements made to the core GPT-5.2 architecture focus on improving grounding, stability, and reasoning quality, allowing this version to dedicate additional computational resources and analytical effort to produce responses that are not only accurate but also well-structured and contextually enriched, especially in the face of complex workflows and multi-step analyses. Excelling in areas that demand continuous logical consistency, GPT-5.2 Thinking is particularly adept at detailed research synthesis, advanced coding and debugging, complex data interpretation, strategic planning, and high-level technical writing, showcasing a significant advantage over its simpler counterparts in assessments that evaluate professional expertise and deep understanding. This advanced model is an essential tool for professionals seeking to tackle sophisticated challenges with precision and expertise. -
33
GPT-5.2 Instant
OpenAI
The GPT-5.2 Instant model represents a swift and efficient iteration within OpenAI's GPT-5.2 lineup, tailored for routine tasks and learning, showcasing notable advancements in responding to information-seeking inquiries, how-to guidance, technical documentation, and translation tasks compared to earlier models. This version builds upon the more engaging conversational style introduced in GPT-5.1 Instant, offering enhanced clarity in its explanations that prioritize essential details, thus facilitating quicker access to precise answers for users. With its enhanced speed and responsiveness, GPT-5.2 Instant is adept at performing common functions such as handling inquiries, creating summaries, supporting research efforts, and aiding in writing and editing tasks, while also integrating extensive enhancements from the broader GPT-5.2 series that improve reasoning abilities, manage longer contexts, and ensure factual accuracy. As a part of the GPT-5.2 family, it benefits from shared foundational improvements that elevate its overall reliability and performance for a diverse array of daily activities. Users can expect a more intuitive interaction experience and a significant reduction in the time spent searching for information. -
34
GPT-5.2 Pro
OpenAI
The Pro version of OpenAI’s latest GPT-5.2 model family, known as GPT-5.2 Pro, stands out as the most advanced offering, designed to provide exceptional reasoning capabilities, tackle intricate tasks, and achieve heightened accuracy suitable for high-level knowledge work, innovative problem-solving, and enterprise applications. Building upon the enhancements of the standard GPT-5.2, it features improved general intelligence, enhanced understanding of longer contexts, more reliable factual grounding, and refined tool usage, leveraging greater computational power and deeper processing to deliver thoughtful, dependable, and contextually rich responses tailored for users with complex, multi-step needs. GPT-5.2 Pro excels in managing demanding workflows, including sophisticated coding and debugging, comprehensive data analysis, synthesis of research, thorough document interpretation, and intricate project planning, all while ensuring greater accuracy and reduced error rates compared to its less robust counterparts. This makes it an invaluable tool for professionals seeking to optimize their productivity and tackle substantial challenges with confidence. -
35
SpawnHQ
SpawnHQ
$59 per monthSpawnHQ is a SaaS platform that enables users to quickly deploy, configure, and manage autonomous AI agents within minutes, eliminating the need for coding or infrastructure setup. By providing a marketplace filled with pre-built, skill-based agents tailored to your brand's context, these agents operate continuously on managed computing resources and seamlessly integrate with various tools such as Discord, web chat widgets, Twitter, SEO services, and customer relationship management systems. Users can select specific skills, including a support bot for addressing customer inquiries, an SEO agent for tracking rankings and creating content, an outbound agent for lead generation and outreach, or social and content engines, and then set up the necessary integrations along with their brand context. Once configured, these agents can respond to natural language commands and function autonomously, managing tasks like research, CRM updates, content creation, and automated replies around the clock. The platform takes care of managed compute, AI model routing (including Claude, GPT, and Gemini), scheduling, logging, reporting, and implementing guardrails, which empowers the agents to think and act with a degree of independence. This capability allows businesses to streamline their operations and enhance efficiency without requiring extensive technical knowledge. -
36
Nano Banana
Google
Nano Banana offers a streamlined, user-friendly way to generate and edit images using Gemini’s “Fast” model. It focuses on fun, casual transformations, making it great for remixing selfies, trying new styles, or merging multiple pictures into a single creation. The model handles character consistency well, ensuring that people look like themselves even when placed in new settings or artistic interpretations. Users can easily perform spot edits like changing backgrounds, adjusting small details, or adding creative elements without needing advanced controls. Nano Banana also excels at playful results such as figurine effects, retro photo booth aesthetics, or themed portraits. These quick edits allow anyone to explore creative concepts in seconds. It’s built for low-effort, high-fun experimentation, making it perfect for social media content or personal projects. Nano Banana provides an approachable entry point for image generation without the depth or complexity of Pro-level features. -
37
ChatKit
OpenAI
ChatKit is a versatile toolkit designed for developers to seamlessly integrate and manage chat agents on various applications and websites. It offers a range of functionalities, including the ability to converse over external documents, text-to-speech features, customizable prompt templates, and quick-access shortcut triggers. Users have the option to operate ChatKit with their personal OpenAI API key, which incurs costs based on OpenAI’s token pricing, or they can utilize ChatKit's credit system, necessitating a license. The platform accommodates a variety of model backends, such as OpenAI, Azure OpenAI, Google Gemini, and Ollama, as well as different routing frameworks like OpenRouter. Additionally, ChatKit boasts features like cloud synchronization, team collaboration tools, web accessibility, launcher widgets, shortcuts, and organized conversation flows over documents, enhancing its usability. Ultimately, ChatKit streamlines the process of deploying sophisticated chat agents, allowing developers to focus on functionality without the burden of constructing an entire chat infrastructure from the ground up. With its extensive capabilities, it empowers teams to create more engaging user interactions effortlessly. -
38
GPT-5.1 Instant
OpenAI
GPT-5.1 Instant is an advanced AI model tailored for everyday users, merging rapid response times with enhanced conversational warmth. Its adaptive reasoning capability allows it to determine the necessary computational effort for tasks, ensuring swift responses while maintaining a deep level of understanding. By focusing on improved instruction adherence, users can provide detailed guidance and anticipate reliable execution. Additionally, the model features expanded personality controls, allowing the chat tone to be adjusted to Default, Friendly, Professional, Candid, Quirky, or Efficient, alongside ongoing trials of more nuanced voice modulation. The primary aim is to create interactions that feel more organic and less mechanical, all while ensuring robust intelligence in writing, coding, analysis, and reasoning tasks. Furthermore, GPT-5.1 Instant intelligently manages user requests through the main interface, deciding whether to employ this version or the more complex “Thinking” model based on the context of the query. Ultimately, this innovative approach enhances user experience by making interactions more engaging and tailored to individual preferences. -
39
GPT-5.1 Thinking
OpenAI
GPT-5.1 Thinking represents an evolved reasoning model within the GPT-5.1 lineup, engineered to optimize "thinking time" allocation according to the complexity of prompts, allowing for quicker responses to straightforward inquiries while dedicating more resources to tackle challenging issues. In comparison to its earlier version, it demonstrates approximately double the speed on simpler tasks and takes twice as long for more complex ones. The model emphasizes clarity in its responses, minimizing the use of jargon and undefined terminology, which enhances the accessibility and comprehensibility of intricate analytical tasks. It adeptly modifies its reasoning depth, ensuring a more effective equilibrium between rapidity and thoroughness, especially when addressing technical subjects or multi-step inquiries. By fusing substantial reasoning power with enhanced clarity, GPT-5.1 Thinking emerges as an invaluable asset for handling complicated assignments, including in-depth analysis, programming, research, or technical discussions, while simultaneously decreasing unnecessary delays for routine requests. This improved efficiency not only benefits users seeking quick answers but also supports those engaged in more demanding cognitive tasks. -
40
GPT-5.2
OpenAI
GPT-5.2 marks a new milestone in the evolution of the GPT-5 series, bringing heightened intelligence, richer context understanding, and smoother conversational behavior. The updated architecture introduces multiple enhanced variants that work together to produce clearer reasoning and more accurate interpretations of user needs. GPT-5.2 Instant remains the main model for everyday interactions, now upgraded with faster response times, stronger instruction adherence, and more reliable contextual continuity. For users tackling complex or layered tasks, GPT-5.2 Thinking provides deeper cognitive structure, offering step-by-step explanations, stronger logical flow, and improved endurance across long-form reasoning challenges. The platform automatically determines which model variant is optimal for any query, ensuring users always benefit from the most appropriate capabilities. These advancements reduce friction, simplify workflows, and produce answers that feel more grounded and intention-aware. In addition to intelligence upgrades, GPT-5.2 emphasizes conversational naturalness, making exchanges feel more intuitive and humanlike. Overall, this release delivers a more capable, responsive, and adaptive AI experience across all forms of interaction. -
41
Grok 4.1 Thinking is the reasoning-enabled version of Grok designed to handle complex, high-stakes prompts with deliberate analysis. Unlike fast-response models, it visibly works through problems using structured reasoning before producing an answer. This approach improves accuracy, reduces misinterpretation, and strengthens logical consistency across longer conversations. Grok 4.1 Thinking leads public benchmarks in general capability and human preference testing. It delivers advanced performance in emotional intelligence by understanding context, tone, and interpersonal nuance. The model is especially effective for tasks that require judgment, explanation, or synthesis of multiple ideas. Its reasoning depth makes it well-suited for analytical writing, strategy discussions, and technical problem-solving. Grok 4.1 Thinking also demonstrates strong creative reasoning without sacrificing coherence. The model maintains alignment and reliability even in ambiguous scenarios. Overall, it sets a new standard for transparent and thoughtful AI reasoning.
-
42
Nano Banana 2
Google
Nano Banana 2 is the newest evolution of Google’s image generation technology, merging the intelligence of Nano Banana Pro with the rapid performance of Gemini Flash. Designed for both speed and quality, it enables users to generate high-fidelity visuals with advanced reasoning capabilities. The model leverages Gemini’s world knowledge and real-time web grounding to render accurate subjects and informative visuals. It improves text rendering accuracy, allowing users to create legible designs and even translate text directly within images. Enhanced instruction adherence ensures the final output closely matches detailed and nuanced prompts. Nano Banana 2 supports consistent character and object representation across complex workflows, making it ideal for storytelling and creative production. It also provides flexible output formats, from 512px images to full 4K resolution. Visual fidelity upgrades bring sharper textures, richer lighting, and more vibrant detail. Integrated across products like the Gemini app, Search, AI Studio, Google Cloud Vertex AI, and Ads, it fits seamlessly into various workflows. By closing the gap between speed and quality, Nano Banana 2 delivers professional-grade image generation at Flash-level performance. -
43
Fluent
Epic Bits
$49Fluent is a macOS-native AI writing and productivity assistant built to eliminate constant app switching. It injects AI directly into any application, using live context to deliver more relevant and accurate responses. Users can write with the right tone, chat with documents, and compare outputs without losing formatting. Fluent supports more than 500 AI models, giving users the freedom to bring their own API keys or run local models for maximum privacy. The Smart Panel works instantly across apps like browsers, email, notes, messaging, and productivity tools. Customizable shortcuts and actions allow users to tailor Fluent to their workflows. Memory and context awareness enable smarter, more consistent results over time. MCP support and dynamic prompt variables unlock advanced automation use cases. Fluent runs fast on both Apple Silicon and Intel Macs. With a one-time purchase and lifetime upgrades, Fluent is built for long-term productivity. -
44
nanobot
nanobot
Nanobot is a lightweight, open-source framework for personal AI assistants that focuses on providing essential agent functionalities and autonomous capabilities within a compact and understandable codebase of roughly 3,400 to 4,000 lines of Python, which is around 99% smaller than similar large agent frameworks. Its design is purposely straightforward and modular, making it accessible for researchers and developers to comprehend, modify, and explore for various projects. The framework includes features such as persistent memory, task scheduling, built-in tools, and the ability to integrate with several large language models through platforms like OpenRouter, allowing it to function locally or to be deployed swiftly using command-line instructions. Furthermore, nanobot supports real-time web searches and can connect through multiple chat platforms, including Telegram, Discord, WhatsApp, and Feishu, enabling seamless interaction across diverse environments. The lightweight structure not only facilitates rapid startup times and minimal resource consumption but also provides a clean architectural framework that developers can easily customize without intricate abstractions, making it an ideal choice for both personal use and experimentation in AI development. Additionally, its user-friendly nature encourages innovation and creativity among developers, fostering an environment ripe for advancements in AI applications. -
45
Gemini 3.1 Flash Image
Google
Gemini 3.1 Flash Image is Google’s next-generation image generation model that merges high-speed performance with advanced visual intelligence. Built to deliver both quality and efficiency, it enables rapid creation of photorealistic and data-driven visuals. The model leverages Gemini’s deep world knowledge and real-time web grounding to produce more contextually accurate results. It enhances text rendering within images, supporting clean typography and seamless multilingual translation. Improved instruction adherence ensures that detailed and nuanced prompts are followed precisely. Gemini 3.1 Flash Image also supports consistent character and object representation across complex scenes, making it ideal for storytelling and branded content. Flexible production specifications allow outputs from 512px to full 4K resolution. Visual upgrades deliver richer lighting, sharper details, and improved texture quality. Integrated across platforms such as the Gemini app, Search AI Mode, AI Studio, and Vertex AI, it fits into diverse workflows. By combining speed, precision, and creative control, Gemini 3.1 Flash Image sets a new benchmark for scalable image generation. -
46
GPT-5.3 Instant
OpenAI
GPT-5.3 Instant represents a significant refinement of ChatGPT’s core conversational model, prioritizing smoother, more natural interactions. This update directly addresses user feedback about tone, unnecessary refusals, and overly defensive disclaimers. The model now provides more direct answers when safe to do so, minimizing conversational friction and reducing dead ends. It also demonstrates improved judgment when handling sensitive topics, offering balanced responses without moralizing preambles. When using web information, GPT-5.3 Instant better synthesizes search results with its internal knowledge, delivering concise and relevant insights instead of link-heavy summaries. Internal evaluations show meaningful reductions in hallucination rates, particularly in high-stakes domains such as medicine, law, and finance. The model is designed to feel consistent and familiar while offering noticeable capability upgrades. Writing performance has been enhanced, enabling richer storytelling and more expressive prose without sacrificing clarity. These improvements aim to make ChatGPT feel less mechanical and more intuitively helpful in everyday use. GPT-5.3 Instant is available across ChatGPT and through the API, with older versions remaining temporarily accessible before retirement. -
47
GPT-5.4 Pro
OpenAI
GPT-5.4 Pro is a high-performance AI model introduced by OpenAI for users who require maximum capability when solving complex problems. It builds on earlier GPT models by integrating advanced reasoning, coding, and workflow automation into a single system. The model is designed to assist professionals with demanding tasks such as data analysis, financial modeling, document generation, and software development. GPT-5.4 Pro can interact directly with computers and applications, allowing AI agents to perform multi-step workflows across different tools and environments. Its extended context window supports up to one million tokens, enabling it to analyze large amounts of information while maintaining accuracy. The model also improves deep web research and long-form reasoning tasks. Developers benefit from improved tool usage and search capabilities that help agents select and operate external tools efficiently. GPT-5.4 Pro delivers stronger coding performance and faster iteration cycles for developers working on complex software projects. It also reduces token usage compared with earlier models, improving cost efficiency and speed. Overall, GPT-5.4 Pro is designed to support advanced professional workflows and AI-powered automation at scale. -
48
GPT‑5.4 Thinking
OpenAI
GPT-5.4 Thinking is a specialized version of OpenAI’s GPT-5.4 model designed to deliver enhanced reasoning and structured problem-solving in ChatGPT. It integrates improvements in coding, professional knowledge work, and agent-based workflows into a single AI system. One of its key features is the ability to present a plan for its reasoning before generating a final answer. This allows users to review the direction of the response and make adjustments while the model is still working. By enabling this interactive process, GPT-5.4 Thinking helps produce more precise and relevant results. The model is particularly effective for tasks that require deep research or multi-step reasoning. It also maintains context across longer prompts and conversations, reducing confusion in complex discussions. GPT-5.4 Thinking improves how AI interacts with tools and software environments during problem-solving workflows. Its advanced reasoning capabilities allow it to handle analytical tasks with higher consistency and clarity. As a result, GPT-5.4 Thinking is designed to support professionals who need reliable AI assistance for complex work. -
49
GPT-5.4 mini
OpenAI
GPT-5.4 mini is an advanced AI model designed to provide a balance between high performance, speed, and cost efficiency. It is built to handle a wide range of tasks, including coding, reasoning, tool usage, and multimodal understanding. Compared to earlier versions, GPT-5.4 mini delivers significantly improved performance while operating at faster speeds. The model is particularly effective in environments where low latency is essential, such as real-time coding assistants and interactive applications. It supports capabilities like function calling, tool integration, and image-based reasoning, making it highly versatile. GPT-5.4 mini is also well-suited for subagent architectures, where it can efficiently process smaller tasks within larger AI systems. Developers can use it to automate workflows, analyze data, and build responsive AI-driven applications. Its strong performance across benchmarks shows that it approaches the capabilities of larger models in many scenarios. At the same time, it maintains a lower cost, making it ideal for high-volume usage. Overall, GPT-5.4 mini provides a powerful and scalable solution for modern AI development. -
50
GPT-5.4 nano
OpenAI
GPT-5.4 nano is a compact and cost-efficient AI model designed for handling lightweight, high-frequency tasks at scale. It is optimized for operations such as classification, data extraction, ranking, and simple coding assistance. The model delivers fast response times, making it suitable for applications where low latency is critical. Compared to earlier nano models, GPT-5.4 nano offers improved performance while maintaining minimal computational cost. It supports key features such as tool usage and structured output generation, allowing it to integrate easily into automated systems. The model is often used as a subagent within larger AI workflows, handling repetitive or supporting tasks efficiently. This approach allows more complex models to focus on higher-level reasoning and decision-making. GPT-5.4 nano is particularly useful in environments that require processing large volumes of requests quickly. Its efficiency makes it ideal for cost-sensitive applications and scalable deployments. Overall, it provides a reliable and fast solution for simple AI-driven tasks.