Inferless

Inferless provides a serverless GPU platform that enables developers to deploy and scale machine learning models instantly with zero infrastructure management.

Visit Website

At a Glance

17Tool Views

Lewes, DelawareHeadquarters

2023Est.

10Employees

Cloud Computing Platforms

AI Tools by Inferless

(1)

Inferless

Serverless GPU ML Deployment

Serverless AI Infrastructure Cloud Platforms

Discussions

No discussions yet

Be the first to start a discussion about Inferless

Latest News

07/01/2024

Products & Services

Inferless Serverless GPU Platform

2023

A platform for deploying and scaling machine learning models on serverless GPUs with automatic resource management and low latency.

Market Position

Positions itself against AWS and Azure by offering a superior developer experience, lower cold-start times, and significantly reduced costs through true serverless efficiency.

Leadership

Founders

Aishwarya Goel

Co-founder and CEO. Previously founded Peakperformer (scaled to $1M ARR). Early team member at PhonePe and Trupay. Started first venture at age 19.

Nilesh Agarwal

Co-founder and CTO. Previously co-founder at Peakperformer. Experience in solving complex technical problems and building scalable infrastructure.

Executive Team

Aishwarya Goel

CEO & Co-founder

Founder of Peakperformer, early at PhonePe.

Nilesh Agarwal

CTO & Co-founder

Technical lead and co-founder of Peakperformer.

Board of Directors

Peak XV Partners representative

Board Member

Blume Ventures representative

Board Member

Founding Story

The founders pivoted from their previous startup, Peakperformer (an AI coaching app), which had reached $900k ARR. They realized the infrastructure for serving custom ML models was inefficient and expensive, leading them to build Inferless to solve these infrastructure challenges.

Business Model

Revenue Model

Serverless pay-per-second billing based on GPU usage time.

Pricing Tiers

Nvidia A100 (Dedicated)

$0.001491/sec

High-performance dedicated A100 GPU for demanding workloads.

Nvidia A10 (Dedicated)

$0.000341/sec

Dedicated A10 GPU for mid-range production workloads.

Nvidia A10 (Shared)

$0.000170/sec

Cost-effective shared A10 GPU resources.

Nvidia T4 (Shared)

$0.000092/sec

Entry-level shared T4 GPU for lighter inference tasks.

Free Tier

$30 in free credits for new users.

Private (Acquihired by Baseten)

Target Markets

Industries & Segments

AI Startups
ML Engineers
Enterprise AI Teams
Developers building with open-source models

Use Cases

Cost-efficient LLM deployment
Low-latency real-time inference
Processing large-scale embeddings
Scaling custom open-source models

Notable Customers

Cleanlab
Spoofsense
Myreader.ai

Quick Facts

Headquarters

Lewes, Delaware, United States

Founded

2023

Entity Type

Inc.

Employees

Total Funding

$3.27M

Investors

Peak XV Partners, Blume Ventures

Office Locations

Bengaluru

Funding History

Seed$3.27M

July 2, 2023

Peak XV Partners

Blume Ventures

History & Milestones

March 2024

Ranked #1 Product of the Day on Product Hunt.

July 2024

Acquihired by Baseten to accelerate innovation in inference infrastructure.

July 2023

Raised $3.27M in Seed funding from Peak XV Partners (Surge) and Blume Ventures.

Key Capabilities

Auto-scaling from zero to hundreds of GPUs

Sub-second cold starts

Dynamic batching for high throughput

Custom runtime environments

CI/CD integration (Hugging Face, GitHub, Docker)

Enterprise-grade security (SOC-2 Type II)

Integrations & Partnerships

Platform Integrations

Hugging Face
GitHub
Docker
NVIDIA GPUs

Key Partnerships

Baseten (Acquirer)

Connect

Website

inferless.com

AI Topics

Inferless focuses on these topics:

Serverless Computing(1)

AI Infrastructure(1)

Cloud Computing Platforms(1)

Back to all developers

Inferless

Inferless provides a serverless GPU platform that enables developers to deploy and scale machine learning models instantly with zero infrastructure management.

Visit Website

At a Glance

17Tool Views

Lewes, DelawareHeadquarters

2023Est.

10Employees

Cloud Computing Platforms

AI Tools by Inferless

(1)

Inferless

Serverless GPU ML Deployment

Serverless AI Infrastructure Cloud Platforms

Discussions

No discussions yet

Be the first to start a discussion about Inferless

Latest News

07/01/2024

Announcing the acquihire of Inferless by Baseten

baseten.co

03/01/2024

Inferless serverless inference solution places #1 on Product Hunt

prweb.com

07/01/2023

Inferless raises Seed funding from Peak XV Partners and Blume Ventures

pitchbook.com

02/01/2024

Launched 'Breakfast with Inferless' tech meetup series

linkedin.com

Products & Services

Inferless Serverless GPU Platform

2023

A platform for deploying and scaling machine learning models on serverless GPUs with automatic resource management and low latency.

Market Position

Positions itself against AWS and Azure by offering a superior developer experience, lower cold-start times, and significantly reduced costs through true serverless efficiency.

Leadership

Founders

Aishwarya Goel

Co-founder and CEO. Previously founded Peakperformer (scaled to $1M ARR). Early team member at PhonePe and Trupay. Started first venture at age 19.

Nilesh Agarwal

Co-founder and CTO. Previously co-founder at Peakperformer. Experience in solving complex technical problems and building scalable infrastructure.

Executive Team

Aishwarya Goel

CEO & Co-founder

Founder of Peakperformer, early at PhonePe.

Nilesh Agarwal

CTO & Co-founder

Technical lead and co-founder of Peakperformer.

Board of Directors

Peak XV Partners representative

Board Member

Blume Ventures representative

Board Member

Founding Story

Business Model

Revenue Model

Serverless pay-per-second billing based on GPU usage time.

Pricing Tiers

Nvidia A100 (Dedicated)

$0.001491/sec

High-performance dedicated A100 GPU for demanding workloads.

Nvidia A10 (Dedicated)

$0.000341/sec

Dedicated A10 GPU for mid-range production workloads.

Nvidia A10 (Shared)

$0.000170/sec

Cost-effective shared A10 GPU resources.

Nvidia T4 (Shared)

$0.000092/sec

Entry-level shared T4 GPU for lighter inference tasks.

Free Tier

$30 in free credits for new users.

Private (Acquihired by Baseten)

Target Markets

Industries & Segments

AI Startups
ML Engineers
Enterprise AI Teams
Developers building with open-source models

Use Cases

Cost-efficient LLM deployment
Low-latency real-time inference
Processing large-scale embeddings
Scaling custom open-source models

Notable Customers

Cleanlab
Spoofsense
Myreader.ai

Quick Facts

Headquarters

Lewes, Delaware, United States

Founded

2023

Entity Type

Inc.

Employees

Total Funding

$3.27M

Investors

Peak XV Partners, Blume Ventures

Office Locations

Bengaluru

Funding History

Seed$3.27M

July 2, 2023

Peak XV Partners

Blume Ventures

History & Milestones

March 2024

Ranked #1 Product of the Day on Product Hunt.

July 2024

Acquihired by Baseten to accelerate innovation in inference infrastructure.

July 2023

Raised $3.27M in Seed funding from Peak XV Partners (Surge) and Blume Ventures.

Key Capabilities

Auto-scaling from zero to hundreds of GPUs

Sub-second cold starts

Dynamic batching for high throughput

Custom runtime environments

CI/CD integration (Hugging Face, GitHub, Docker)

Enterprise-grade security (SOC-2 Type II)

Integrations & Partnerships

Platform Integrations

Hugging Face
GitHub
Docker
NVIDIA GPUs

Key Partnerships

Baseten (Acquirer)

Connect

Website

inferless.com

AI Topics

Inferless focuses on these topics:

Serverless Computing(1)

AI Infrastructure(1)

Cloud Computing Platforms(1)

Back to all developers