Category · AI agent infrastructure
Agent infrastructure: vector DBs, inference, observability, and search.
agent_infra is what builders touch when they're wiring up RAG, an agent loop, or an LLM application. Magpie covers the vector DB layer (Pinecone), inference (Together AI, Groq, Hugging Face), observability (Helicone, Langfuse), web search APIs (Tavily), and self-hostable model runners (Ollama, Replicate). Picks favour OSS-first or generous free-tier pricing over enterprise contracts.
16 tools in this category. Curated and scored by Magpie's six-dimension rubric.
PyTorch
Open-source machine-learning framework
Free
Ollama
The easiest way to run open language models locally
Free·Solopreneur → MVP
Pinecone
Reference vector database for RAG and semantic search — Starter tier is free up to 2GB
Free·MVP → Growth
Hugging Face
The model hub the open-source AI ecosystem runs on — free Spaces, $9 PRO, $20/user Team
Free·Solopreneur → Growth
Replicate
Run, fine-tune, and deploy AI models with one line of code
Free·MVP → Growth
Fireworks AI
Fast, low-cost inference for open-source models
Free
Groq
Sub-second LPU inference — Llama 3.1 8B at 840 tokens/sec for $0.05/M input
Free·Solopreneur → Seed
LangChain
Open-source framework for building LLM applications
Free
Modal
Serverless GPU compute for AI builders
Free
Llama
Meta's open-weights family of foundation models
Free
Helicone
Open-source LLM observability — 10K free requests, OpenAI/Anthropic/Together drop-in proxy
Free·Solopreneur → Seed
Langfuse
Open-source LLM observability and evals — Hobby tier free, $29/mo Core, self-hostable
Free·MVP → Growth
Tavily
Web search API designed for AI agents — 1,000 free credits/mo, $0.008/credit PAYG
Free·Solopreneur → MVP
Together AI
Cheap, fast inference for open models — Llama 3.3 70B at $0.88 per million tokens
n/a·MVP → Growth
Gemma
Google's open-weights model family
Free
Qwen
Alibaba's open-weights language model family
Free
Build your own stack
Want a stack tuned to your stage and function?
Tell Magpie what you do and we'll match tools across build, comms, productivity and your industry.
Build my stack