PinnedFireworks.aiFireworks.ai: Fast, Affordable, Customizable Gen AI Platformtl;dr Fireworks.ai releases the fast, affordable, and customizable Fireworks GenAI Platform. It enables product developers to run…7 min read·Aug 17, 2023----
Fireworks.aiFireAttention — Serving Open Source Models 4x faster than vLLM by quantizing with ~no tradeoffsServing Open Source Models 4x faster than vLLM by quantizing with ~no tradeoffs6 min read·Jan 8, 2024--1--1
Fireworks.aiFireworks Raises the Quality Bar with Function Calling Model and API ReleaseFireworks conducts alpha launch of our function calling model and API, with quality reaching GPT-4 and surpassing open-source models9 min read·Dec 20, 2023--1--1
Fireworks.aiMixtral 8x7B on Fireworks: faster, cheaper, even before the official releaseThe newest Mistral AI MoE model, Mixtral 8x7B, is available on the Fireworks platform in both base and instruction-tuned variants. We offer…4 min read·Dec 14, 2023----
Fireworks.aiLLM Inference Performance Benchmarking (Part 1)Optimizing Large Language Model (LLM) machine performance in inference is a complex space and no solution is one-size-fits-all. Use cases…3 min read·Nov 3, 2023--1--1
Fireworks.aiNew in Fireworks: Image-to-Image and ControlNet support for SSD-1B and SDXL!The Fireworks.ai blazing-fast inference platform enables developers to build with generative AI to accelerate product innovation.5 min read·Nov 2, 2023----
Fireworks.aiFireworks.ai Achieves SOC 2 Type II and HIPAA ComplianceWe are pleased to report that the Fireworks.ai inference platform is both SOC 2 Type II and HIPAA compliant. These important milestones…2 min read·Oct 27, 2023----
Fireworks.aiAccelerating Code Completion with Fireworks Fast LLM InferenceAt Fireworks.ai, we provide the world’s fastest LLM inference platform which enables developers to run, fine-tune, deploy, and share large…3 min read·Oct 11, 2023----
Fireworks.aiFireworks.ai Now Available on LangChain Prompt PlaygroundWith the popularity of large language models (LLMs) such as ChatGPT and Llama 2, there are now a multitude of models to choose from —…5 min read·Oct 2, 2023----
Fireworks.aiSimplifying Code Infilling with Code Llama and Fireworks.aiLlama 2 and Code Llama3 min read·Sep 12, 2023----