{"id":"chutes","title":"Chutes","content":"**Chutes** is a serverless compute platform designed for deploying, scaling, and running open-source artificial intelligence (AI) models. Developed by Rayon Labs, it operates on a decentralized, open-source infrastructure to provide AI inference and other computational services for developers and enterprises. [\\[1\\]](#cite-id-VMRraSdvPv) [\\[2\\]](#cite-id-VWVRYqOVuf)\n\n## Overview\n\nChutes offers a serverless platform for developers to run open-source AI models without managing the underlying infrastructure. Its decentralized architecture is designed for scalability and processes trillions of tokens monthly. The platform simplifies access to high-performance AI computation by keeping popular models \"permanently hot\" for immediate, low-latency inference. It also provides the flexibility for users to deploy their own custom models. [\\[1\\]](#cite-id-VMRraSdvPv)\n\n## Technology and Infrastructure\n\nChutes operates as a serverless compute layer for AI tasks. This model abstracts away server management, allowing developers to focus on their application code. The platform is responsible for allocating resources, scaling, and managing the execution environment for each job. [\\[1\\]](#cite-id-VMRraSdvPv)\n\nThe infrastructure is described as decentralized and open-source. This suggests a distributed network of compute resources rather than a centralized data center model. This design can contribute to resilience and potentially lower operational costs. The platform is engineered to handle various AI workloads, with a primary focus on model inference. [\\[1\\]](#cite-id-VMRraSdvPv)\n\nA key technical feature is the \"permanently hot\" model system. By keeping frequently used AI models loaded and active, Chutes aims to minimize the cold-start delays often associated with serverless functions, making it suitable for real-time applications. The platform's team monitors for new open-source model releases and works to integrate them quickly, often making them available on the platform within a short time after their public release. [\\[1\\]](#cite-id-VMRraSdvPv)\n\n## Services and Features\n\nChutes offers a range of services centered around AI model execution and plans to expand its capabilities. [\\[1\\]](#cite-id-VMRraSdvPv)\n\n### AI Model Inference\n\nThe primary service is high-performance inference for a variety of AI models. Users can access these models via an API to integrate AI capabilities into their own applications. The platform provides analytics for monitoring usage and performance. [\\[1\\]](#cite-id-VMRraSdvPv)\n\n### Model Support\n\nChutes supports a diverse set of AI model types, allowing for a wide array of applications. The platform categorizes its supported models into several groups:\n\n* **Large Language Models (LLMs):** For tasks involving text generation, summarization, and conversation.\n* **Image Generation:** For creating images from text prompts (diffusion models).\n* **Video, Speech, and Music:** For processing and generating multimedia content.\n* **Embedding Models:** To convert data into numerical representations for search, recommendation, and classification tasks.\n* **Content Moderation:** For detecting hate speech, NSFW content, and other undesirable material.\n* **3D Generation:** For creating 3D models and animations.\n* **Custom Models:** Users can deploy their own open models on the platform.\n\nThis range of support indicates the platform's goal to be a comprehensive resource for various AI development needs. [\\[1\\]](#cite-id-VMRraSdvPv)\n\n### Planned Services\n\nChutes has announced several upcoming features to broaden its service offerings:\n\n* **Long Jobs:** This feature is intended for long-running, asynchronous tasks such as batch processing, data analysis, and model training.\n* **TEE/Secure Compute:** A planned service that will use Trusted Execution Environments (TEEs) to provide secure, private, and isolated compute environments for sensitive data and proprietary models.\n* **Consumer Applications:** The company plans to release its own consumer-facing apps, named Chutes Chat and Chutes Studio.\n\nThese planned additions suggest a strategy to serve a wider range of computational needs, from individual developers to enterprise clients with security-sensitive workloads. [\\[1\\]](#cite-id-VMRraSdvPv)\n\n## Available Models and Integrations\n\nChutes provides access to numerous open-source models from various research labs and companies. The platform highlights its ability to quickly host new and popular models. Some of the model providers featured on the platform include DeepSeek, Mistral AI, Microsoft, Google (Gemma), Qwen (Alibaba), and Moonshot AI (Kimi). Specific models available include variants of DeepSeek V3, Mistral Small, and NousResearch's DeepHermes. [\\[1\\]](#cite-id-VMRraSdvPv) [\\[2\\]](#cite-id-VWVRYqOVuf)\n\nThe company's website lists several projects and companies that use or integrate with its services, including:\n\n* Bittensor\n* OpenRouter\n* Cline\n* Kilo\n* Roo Code\n* Fetch.ai\n* DeepFakeAI\n\nThese integrations demonstrate the platform's adoption within the AI and decentralized technology ecosystems. [\\[1\\]](#cite-id-VMRraSdvPv)\n\n## Development\n\nChutes is developed and operated by Rayon Labs. The platform's official X (formerly Twitter) account was created in November 2024, and it actively posts updates regarding new features, model availability, and pricing changes. [\\[1\\]](#cite-id-VMRraSdvPv) [\\[2\\]](#cite-id-VWVRYqOVuf)\n\n## Pricing Model\n\nChutes utilizes a subscription-based pricing model with several tiers, supplemented by a pay-as-you-go (PAYG) option for usage that exceeds plan limits. The pricing structure was updated in August 2025 to provide fixed monthly plans. [\\[2\\]](#cite-id-VWVRYqOVuf)\n\nThe available subscription tiers are:\n\n* **Base:** A low-cost plan designed for individuals or small projects, offering up to 300 requests per day.\n* **Plus:** A mid-tier plan that includes up to 2,000 requests per day and standard email support.\n* **Pro:** A higher-tier plan providing up to 5,000 requests per day and priority support.\n* **Enterprise:** A custom plan with unlimited requests, dedicated support, Service [Level](https://iq.wiki/wiki/level) Agreement (SLA) guarantees, and custom billing options.\n\nAll paid tiers include unlimited API keys, access to all available models, and use of the Chutes Chat and Chutes Studio applications. The platform also offers a program for startups, providing up to $20,000 in free credits to eligible companies. [\\[1\\]](#cite-id-VMRraSdvPv)","summary":"Chutes is an open-source, decentralized serverless compute platform for AI, developed by Rayon Labs. It specializes in deploying, scaling, and running open-source models, offering high-performance AI inference for developers and enterprise applications.","images":[{"id":"QmR8mzLB2uPtNonjCqzZnZtgZd1WZa8xpMrGgxh1Xpd237","type":"image/jpeg, image/png"}],"categories":[{"id":"organizations","title":"organizations"}],"tags":[{"id":"AI"},{"id":"Protocols"},{"id":"Organizations"},{"id":"Developers"},{"id":"Blockchains"}],"media":[{"id":"QmNyB6VcPhMFJw3VgkVsSyJ5dbgM7QtGHAwcuuuHRH3HQe","type":"GALLERY","source":"IPFS_IMG"},{"id":"QmaPKAjX5hGAf2pEPPpgKChEFEBzmRHYwu9r5BuxSt8wa7","type":"GALLERY","source":"IPFS_IMG"},{"id":"QmQNCGhkBbP4pXyUz1aByWvc5rRaGYrmRxxhbbiNc5bGQ6","type":"GALLERY","source":"IPFS_IMG"},{"id":"QmYgiofnf1CasYre28x3YVFePUkoE4kbYytaaATzVFZeHc","type":"GALLERY","source":"IPFS_IMG"},{"id":"QmfG1Pv5PEFgjTakxXN9jFK7piNQKpVdWSfp92vpQA3xtg","type":"GALLERY","source":"IPFS_IMG"}],"metadata":[{"id":"references","value":"[{\"id\":\"VMRraSdvPv\",\"url\":\"https://chutes.ai/\",\"description\":\"Chutes official website\",\"timestamp\":1754581524951},{\"id\":\"VWVRYqOVuf\",\"url\":\"https://x.com/chutes\\\\_ai\",\"description\":\"Chutes official X (Twitter) profile\",\"timestamp\":1754581524951}]"},{"id":"previous_cid","value":"\"https://ipfs.everipedia.org/ipfs/Qmabz9V32ByD6VQEK93i6xLMt7hyBHh4GTbF4MqjLckXrB\""},{"id":"commit-message","value":"\"Republished wiki with updated Overview section\""},{"id":"previous_cid","value":"Qmabz9V32ByD6VQEK93i6xLMt7hyBHh4GTbF4MqjLckXrB"}],"events":[{"id":"2784b5da-096a-4332-aab3-273e2dfd6859","date":"2024-11","title":"Chutes Founded","type":"CREATED","description":"Chutes was founded by Rayon Labs as a decentralized, distributed serverless AI compute platform. The official X (Twitter) account was created in November 2024.","multiDateStart":null,"multiDateEnd":null},{"id":"cc9e0a76-4aa4-4a8f-9696-1365ca704c03","date":"2025-08","title":"New Pricing Tiers Launched","type":"DEFAULT","description":"Chutes announced and launched new monthly subscription tiers, starting from $3 per month, offering up to 5,000 requests per day for developers and enterprise users.","multiDateStart":null,"multiDateEnd":null},{"id":"70c7c732-0daa-48a0-babe-57a5b0b7f11c","date":"2025-08","title":"GPT OSS 120B Model Goes Live","type":"DEFAULT","description":"Chutes made the GPT OSS 120B model available on its platform, offering it at competitive prices for token input and output.","multiDateStart":null,"multiDateEnd":null}],"user":{"id":"0x8af7a19a26d8fbc48defb35aefb15ec8c407f889"},"author":{"id":"0x8af7a19a26d8fbc48defb35aefb15ec8c407f889"},"language":"en","version":1,"linkedWikis":{"blockchains":[],"founders":[],"speakers":[]}}