Discuss AI cost optimization, share architecture patterns, and connect with developers building with LLMs.
Hey everyone, Just thought I'd drop in to share a cool optimization I recently implemented while working with the llama.cpp model. If you've been using the Llama architecture, you…
Hey folks, I wanted to share an interesting development in AI accessibility. Recently, the Maltese government announced a partnership with OpenAI to provide ChatGPT Plus to their e…
I've been working on a project that requires processing a hefty volume of legal documents, and I recently made the switch to using the Falcon-40B model to streamline my workflow. P…
I recently stumbled upon an intriguing case while exploring OpenReview that made me rethink the integrity of certain AI-themed academic initiatives marketed to high school students…
Hey folks! I recently came across an intriguing case study about how smaller countries are adopting large language models for national benefit. Specifically, I read about the partn…
Hey folks, I've been leveraging several LLMs across different providers (OpenAI, Azure, and AWS) for a few projects and tracking the costs has been a bit of a nightmare. I've bee…
Hey fellow AI enthusiasts! Are you working on a cool machine learning project, or maybe launching a new AI startup? This is the spot to share what you're up to and maybe even find…
Hey folks, I've been working on a RAG (Retrieval-Augmented Generation) pipeline for a client project, and as expected, the costs are adding up. We're primarily using OpenAI's Ada…
Greetings everyone! It's that time of the month where we connect talent with opportunity in the AI and machine learning realm. For those Hiring, kindly follow this format: ### Po…
Hey everyone! Launching a new thread to help developers and companies in the AI/LLM space find each other more easily. **For Companies Hiring:** - **Location:** e.g., New York - *…
Hello everyone, welcome to our AI/LLM monthly job connection thread! If you're an employer looking to add AI/LLM expertise to your team, please use the instruction below: - **Posi…
Hey folks, I'm at the crossroads of choosing between OpenAI and Anthropic for powering our customer support bot with LLMs. I've been crunching the numbers, and while both have thei…
Hey folks, I've been working with Claude API over the last few months and the costs have started to pile up quickly. We started noticing this especially when scaling up our usage f…
Recently, while navigating the web of academic papers on platforms like OpenReview, I stumbled across an intriguing figure—an individual named Alex Chen, whose profile mentions an…
Just wanted to share my recent experience incorporating OpenAI's Codex model into a mobile application. I've been working on a chatbot integration for a client who wanted to bring…
Hey everyone, I've been working on a Retrieval-augmented Generation (RAG) pipeline and wanted to break down the costs and check if others are experiencing the same. Here's the s…
Hey everyone! I've been exploring cost-effective ways to work with large language models, especially here in China. After some experimentation and a bit of research, I've found an…
Hey folks, I've been evaluating OpenAI and Anthropic as potential providers for our company's upcoming LLM-based project. We're primarily looking at GPT-4 from OpenAI and Claude f…
Hi all, I've recently completed a hefty experiment with large language models that cost me just north of $1 million in a single month. I wanted to share my experience in hopes it…
Hey everyone, I recently switched from using ZephyrCode Pro to OpenLogic AI and faced an unexpected issue. After a few refreshing months with ZephyrCode's advanced plan, I decided…
Hey everyone, I’ve been diving deep into utilizing large language models (LLMs) like GPT-3 for a series of projects, primarily focused on text generation and natural language unde…
Hello everyone, I'm Jake, working on a new project called CodePaperHub. After witnessing the decline of resources like the beloved PapersWithCode post-acquisition, I was driven to…
I recently embarked on a project to integrate OpenAI's ChatGPT with financial tools, and I decided to use Plaid for connecting to bank accounts. Initially, I was a bit skeptical ab…
Hello fellow developers! I've been diving deep into the world of Large Language Models (LLMs) and wanted to share some lessons learned about managing costs effectively. Working wit…
Hey folks, I recently implemented a Retrieval-Augmented Generation (RAG) pipeline and I'm trying to get a clearer idea of where the costs are piling up. Here's a breakdown of my st…
A place for developers building with LLMs to share insights about AI cost optimization, architecture patterns, and best practices.
6,412
70,258
381,666
203
Join the conversation
Sign in to post, vote, comment, and connect with other developers.
Create a custom drag-and-drop report for any GitHub repo with AI usage.