0% found this document useful (0 votes)

112 views12 pages

DSPy a Framework for Programming With LLMs

Uploaded by

nishchay.mahor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

112 views12 pages

DSPy a Framework for Programming With LLMs

Uploaded by

nishchay.mahor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

DSPy: A Framework for

Programming with LLMs

Authored By

Rohit Sroch
Sr. AI Scientist at AI Labs
C5i
The rapid evolution of artificial intelligence, particularly with the emergence of LLMs, has spurred
the development of several frameworks focused on enhancing interactions with these large
language models, marking a significant technological advancement.

While numerous frameworks concentrate on prompting LLMs, DSPy is a frontrunner in this

advancement, introducing a novel approach to programming LLMs. This article delves deep into
the intricacies of DSPy, analyzing its programming framework, comparing it with other
frameworks, and offering practical demonstrations to highlight its capabilities.

Introduction to DSPy
DSPy, which stands for Declarative Sequencing Python framework, signifies a fundamental
change in how developers engage with LLMs. Conventional approaches involve manually creating
prompts, a task prone to being time-consuming and inaccurate.

The DSPy framework is designed to offer building blocks for developing LLM applications through
programming, similar to frameworks like PyTorch, in which we build neural networks by selecting
required network layers and optimizers to minimize training loss for specific metrics we aim to
optimize and learn model parameters.

But programming with foundational models follows a distinct paradigm, particularly emphasized
in the context of in-context learning. Within this paradigm, we identify tasks, write instructions
using natural language prompts, adjust the wording, give examples as few-shot of desired model
outcomes, provide the right context, and refine as needed.

Challenges faced with prompting LLMs:

1. Complexity of Breaking Down Problems: The process of breaking down problems into
isolated steps and effectively prompting an LLM for each step can be complex and
time-consuming.
2. Integration and Cohesion of Steps: Once individual steps are established, integrating, and
ensuring the smooth operation of these steps together can present challenges, requiring
careful tweaking and adjustments.

3. Manual Identification of Few-shot Examples: The manual identification of few-shot

examples to fine-tune each step can be labor-intensive and may require significant
domain expertise.

4. Maintenance and Adaptability: Maintaining prompt coherence becomes challenging when

there are changes in the pipeline, the LLM itself, or the data, potentially necessitating
frequent updates to prompts or fine-tuning steps which is further time-consuming.

© C5i 01
DSPy tackles these challenges by introducing a programmable interface that allows for
algorithmic optimization of prompts and model weights, particularly beneficial when LLMs are
employed multiple times within a pipeline. This enhances the efficiency and effectiveness of
interactions with language models.

Why DSPy? Comparison to other

Frameworks
To appreciate the uniqueness of DSPy, it is essential to understand how it contrasts with similar
technologies:

LangChain and LlamaIndex: LangChain aims to chain language models for application building
while LlamaIndex focuses on improving search capabilities within texts. However, both
programmable approach offers a distinct advantage in precision and adaptability. DSPy’s
niche is in optimizing prompt construction for better interaction with LLMs.

PyTorch: PyTorch is a comprehensive framework for a wide range of deep learning

applications. In analogy, DSPy offers specialized functionality for working with pre-trained
language models, providing general-purpose modules, optimizers, metrics, etc. for developers
looking to harness the power of natural language processing.

DSPy Programming Components

Training Data

DSPy Program

Optimizer & Metric

Write DSPy Program

Specify Evaluation
Logic & Optimizer
LLMs
Metrics
Signatures Compile
Accuracy DSPy Program
Inline
Collect Exact-Match
Training Data Class-Based DSPy Compiler

Optimizer
Modules
LabeledFewShot
Chain-of-thought
BootstrapFewShot
ReAct
Human
Annotator

Iterate

Language Models: The framework provides easy access to several large language models like
AzureOpenAI, GoogleVertexAI, Amazon Bedrock, Claude, OpenAI, Mistral, etc. for building LLM
based applications.

Signatures: A declarative specification that outlines the input/output behavior of a DSPy

module which specifies LLM what it needs to do rather than how to do it. While defining
signatures, we provide descriptions of fields, and these field names hold semantic
significance, forming the basis for constructing prompts.

The framework provides two ways to define signature:

• Inline Signature: Defined as a short string for common tasks like question answering,
sentiment classification, etc.

• Class-based Signature: Defined as a custom class for advanced tasks that need more
verbose signature.

Modules: This constitutes the core aspect of the DSPy program responsible for managing
flow logic. DSPy offers pre-built modules for fundamental tasks such as Predict, Chain of
Thought, ReAct, etc. Moreover, we also have the option to craft custom modules and combine
multiple ones as needed.

Metrics: The framework provides common metrics like (accuracy, exact match, F1-score,
etc.) for evaluation and optimization. We can define a custom metric as it is a function that
will take examples from your data, the output of your system and return a score that
quantifies how good the output is.

Optimizers (formerly Teleprompters): Optimizers automatically tune prompts (by adding

examples through the selection procedure) and model parameters (like temperature). Also,
optimizers assess performance based on the metric being optimized. The framework offers
several optimizers like, LabeledFewShot and BootstrapFewShot, etc.

Compiler: The module's instructions are optimized to obtain relevant and efficient examples
for the task at hand. The compiled program can then be saved to disk and reloaded,
functioning similarly to checkpoints.

Note that DSPy is an open-source framework with an active community that is continuously
evolving in terms of more components, further reducing the time/effort required to build LLM
applications.

© C5i 03
Text to SQL Implementation using DSPy
Let’s consider building a basic LLM application that allows users to ask natural questions
and generate an SQL query based on the provided database schema.

Using Langchain framework

a) Define the access to AzureOpenAI LLM (GPT4)

c) Define a basic chain

Using DSPy framework

a) Define the access to AzureOpenAI LLM (GPT4)

c) Define the custom Text to SQL module

© C5i 07
d) Define the optimizer & evaluation metric and load labeled data, which will be used to identify
the few shot examples algorithmically.

e) Compile the DSPy program, which algorithmically tunes the prompt by identifying the best
few-shot examples

The above shows how we can use the Langchain or DSPy framework to implement Text to SQL
tasks. In contrast, DSPy provides the following advantages:
DSPy framework leverages optimizers to algorithmically tune the input prompt using
provided labeled data, significantly reducing the manual effort required for prompt
optimization compared to Langchain.

Unlike Langchain, which requires manual prompt optimization to align the model with the
desired output, DSPy can algorithmically identify the best few-shot examples, enhancing
accuracy without manual intervention.

DSPy framework includes a compiler that seamlessly recompiles the entire pipeline in
response to any changes in the LLM version or type. In contrast, Langchain necessitates
manual prompt retuning when such changes occur.

DSPy's automated processes contrast Langchain's manual methods, making it more efficient
and less labor-intensive to achieve desired model outputs and adapt to model variations.

© C5i 09
Conclusion
DSPy enables future interaction with LLMs by allowing
users to build LLM applications time-efficiently by
programming (not prompting) with LLMs. Instead of
focusing on hand-crafted prompts that target specific
applications, DSPy has general-purpose modules that
learn to prompt (or finetune) a language model. This
approach to programming LLMs fixes the problem of
too-fragile prompts, as you can recompile the pipeline
when making changes to your code, data, assumptions,
or metric. DSPy will then automatically create new
effective prompts that fit your changes.

References
https://dspy-docs.vercel.app/docs/intro
https://github.com/stanfordnlp/dspy
https://arxiv.org/pdf/2310.03714
https://python.langchain.com/v0.1/docs/get_started/introduction/
https://docs.llamaindex.ai/en/stable/
https://pytorch.org/docs/stable/index.html

© C5i 10
About Us
C5i is a pure-play AI & Analytics provider that combines the
power of human perspective with AI technology to deliver
trustworthy intelligence. The company drives value through a
comprehensive solution set, integrating multifunctional teams
that have technical and business domain expertise with a robust suite of
products, solutions, and accelerators tailored for various horizontal and
industry-specific use cases. At the core, C5i’s focus is to deliver business
impact at speed and scale by driving adoption of AI-assisted decision-making.

C5i caters to some of the world’s largest enterprises, including many Fortune
500 companies. The company’s clients span Technology, Media, and Telecom
(TMT), Pharma & Lifesciences, CPG, Retail, Banking, and other sectors. C5i has
been recognized by leading industry analysts like Gartner and Forrester for its
Analytics and AI capabilities and proprietary AI-based platforms.

www.c5i.ai

Transforming Conversational AI: Exploring The Power of Large Language Models in Interactive Conversational Agents 1st Edition Michael Mctear
100% (7)
Transforming Conversational AI: Exploring The Power of Large Language Models in Interactive Conversational Agents 1st Edition Michael Mctear
62 pages
Michael Miller - Absolute Beginner's Guide Computer Basics, Windows 11 Edition, 10th Edition-Que Publishing - Pearson Education (2023)
100% (3)
Michael Miller - Absolute Beginner's Guide Computer Basics, Windows 11 Edition, 10th Edition-Que Publishing - Pearson Education (2023)
551 pages
PwC Presentations Best Practices, PPT Hehe Presentation Slide Deck, Excel, Word, & PD
No ratings yet
PwC Presentations Best Practices, PPT Hehe Presentation Slide Deck, Excel, Word, & PD
23 pages
GitHub - iptv-org_awesome-iptv_ A curated list of resources rela
No ratings yet
GitHub - iptv-org_awesome-iptv_ A curated list of resources rela
9 pages
Capacitor Filter
100% (2)
Capacitor Filter
21 pages
RW E-Jet User Manual V20-2-4 Rottweil
No ratings yet
RW E-Jet User Manual V20-2-4 Rottweil
107 pages
Multiple Project Tracking Template Excelx Year2025
No ratings yet
Multiple Project Tracking Template Excelx Year2025
28 pages
Data Science Guide
100% (1)
Data Science Guide
275 pages
Dokumen - Pub Algorithm Design Techniques
No ratings yet
Dokumen - Pub Algorithm Design Techniques
555 pages
Falancs User en PDF
No ratings yet
Falancs User en PDF
784 pages
Cornell 24may24 Quantum Computers
No ratings yet
Cornell 24may24 Quantum Computers
143 pages
Msazure - Create Your Own GenAI Apps
No ratings yet
Msazure - Create Your Own GenAI Apps
30 pages
Code Generation With LLMs
No ratings yet
Code Generation With LLMs
59 pages
(eBook PDF) Hands-On Machine Learning with Scikit-Learn and TensorFlow download pdf
100% (9)
(eBook PDF) Hands-On Machine Learning with Scikit-Learn and TensorFlow download pdf
45 pages
PMO_1737215480
No ratings yet
PMO_1737215480
51 pages
AI Curricullum of Outskills
No ratings yet
AI Curricullum of Outskills
5 pages
ai agents
No ratings yet
ai agents
13 pages
ChatGPT-Ebook-4th-edition
No ratings yet
ChatGPT-Ebook-4th-edition
109 pages
Building AI Agents
No ratings yet
Building AI Agents
3 pages
AI Privacy Risks AI
100% (1)
AI Privacy Risks AI
107 pages
Automotive Ethernet Black Book
100% (1)
Automotive Ethernet Black Book
44 pages
AI-Powered Automated Web Development System
No ratings yet
AI-Powered Automated Web Development System
6 pages
PythonAI LLMs ForSharing
No ratings yet
PythonAI LLMs ForSharing
47 pages
IMPLEMENTATION_OF_GENERATIVE_A (1)
No ratings yet
IMPLEMENTATION_OF_GENERATIVE_A (1)
13 pages
_OceanofPDF.com_LLMs_in_Enterprise_-_Ahmed_Menshawy
No ratings yet
_OceanofPDF.com_LLMs_in_Enterprise_-_Ahmed_Menshawy
194 pages
Building Acoustic Suspension Subwoofer Boxes
0% (1)
Building Acoustic Suspension Subwoofer Boxes
12 pages
Artificial Intelligence Tools (1)
No ratings yet
Artificial Intelligence Tools (1)
23 pages
DM14 Visualisation
100% (1)
DM14 Visualisation
67 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
91 pages
AI Projects
100% (1)
AI Projects
5 pages
AI and Its Impact To The Future of Work - PRE ICP IBS 2024 V01
No ratings yet
AI and Its Impact To The Future of Work - PRE ICP IBS 2024 V01
32 pages
OceanofPDF.com Machine Learning for Beginners Complete a - Declan Mellor
No ratings yet
OceanofPDF.com Machine Learning for Beginners Complete a - Declan Mellor
102 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
NLP - Natural Language Processing
No ratings yet
NLP - Natural Language Processing
74 pages
Corsen Builders: Rehabilitation Project
No ratings yet
Corsen Builders: Rehabilitation Project
26 pages
Sinan Ozdemir - Quick Start Guide to Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide to Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
Power Systems Consulting: Efficient Grids For Electric Utilities and Industrial Companies
No ratings yet
Power Systems Consulting: Efficient Grids For Electric Utilities and Industrial Companies
41 pages
LLM Monitoring and Observability - A Summary of Techniques and Approaches For Responsible AI - by Josh Poduska - Towards Data Science
No ratings yet
LLM Monitoring and Observability - A Summary of Techniques and Approaches For Responsible AI - by Josh Poduska - Towards Data Science
12 pages
Generative AI Course Content Simply Learn
No ratings yet
Generative AI Course Content Simply Learn
36 pages
Lecture # 1-2 Introduction To Gen AI
No ratings yet
Lecture # 1-2 Introduction To Gen AI
41 pages
Global Consultant Bootcamp Starter Kit
No ratings yet
Global Consultant Bootcamp Starter Kit
11 pages
Building Intelligent Agents with Semantic Kernel: A Comprehensive Guide
No ratings yet
Building Intelligent Agents with Semantic Kernel: A Comprehensive Guide
16 pages
6 Scale Smile Rating Scale Infographics
No ratings yet
6 Scale Smile Rating Scale Infographics
20 pages
Learner Guide Troubleshooting HP Networks 1041 No Watermark
No ratings yet
Learner Guide Troubleshooting HP Networks 1041 No Watermark
108 pages
A Review On Large Language Models Architectures Ap
No ratings yet
A Review On Large Language Models Architectures Ap
31 pages
Natural Language Processing
No ratings yet
Natural Language Processing
21 pages
ML Lab Programs (1-12)
No ratings yet
ML Lab Programs (1-12)
35 pages
DSP Lab Expt 1 Using Function - 3-13
No ratings yet
DSP Lab Expt 1 Using Function - 3-13
11 pages
Build Dont Talk - Raj Shamani
100% (27)
Build Dont Talk - Raj Shamani
178 pages
GenAI Pinnacle Roadmap
100% (1)
GenAI Pinnacle Roadmap
8 pages
Building Your Own Autonomous LLM Agents - LinkedIn
No ratings yet
Building Your Own Autonomous LLM Agents - LinkedIn
33 pages
AI Engineer Roadmap
No ratings yet
AI Engineer Roadmap
22 pages
Sertifikat New
No ratings yet
Sertifikat New
7 pages
SAS Enterprise Guide Project For Editing and Imputation
No ratings yet
SAS Enterprise Guide Project For Editing and Imputation
10 pages
Team Deloitte - MyApp - Project
No ratings yet
Team Deloitte - MyApp - Project
34 pages
Deltek Web Services Programmer's Guide
No ratings yet
Deltek Web Services Programmer's Guide
67 pages
FLeX Controller
No ratings yet
FLeX Controller
2 pages
Data Ready Ai
No ratings yet
Data Ready Ai
8 pages
Generative AI
100% (1)
Generative AI
4 pages
Understanding Large Language Models: Learning Their Underlying Concepts and Technologies 1st Edition Thimira Amaratunga - Download the full set of chapters carefully compiled
100% (1)
Understanding Large Language Models: Learning Their Underlying Concepts and Technologies 1st Edition Thimira Amaratunga - Download the full set of chapters carefully compiled
55 pages
Universal Orlando Resort Website Reference Guide
No ratings yet
Universal Orlando Resort Website Reference Guide
5 pages
AI and Automation in Business
No ratings yet
AI and Automation in Business
10 pages
PHD Proposal Heriot P&A
No ratings yet
PHD Proposal Heriot P&A
3 pages
MLOps Buyers Guide by Seldon
No ratings yet
MLOps Buyers Guide by Seldon
11 pages
Manufacturing: Sic Power Electronics For Variable Frequency Motor Drives
No ratings yet
Manufacturing: Sic Power Electronics For Variable Frequency Motor Drives
5 pages
Computer Networks Chapter 6 Multimedia Networking Notes
No ratings yet
Computer Networks Chapter 6 Multimedia Networking Notes
3 pages
KNL4343 Lecture10
No ratings yet
KNL4343 Lecture10
21 pages
Machine Learning Is Fun 1565131730
No ratings yet
Machine Learning Is Fun 1565131730
48 pages
Sector:: Automotive/Land Transport Sector
No ratings yet
Sector:: Automotive/Land Transport Sector
20 pages
Intel GenAI Hackathon
No ratings yet
Intel GenAI Hackathon
10 pages
CRP Walmart & Procter and Gambel
No ratings yet
CRP Walmart & Procter and Gambel
8 pages
Make 200 in 24H
No ratings yet
Make 200 in 24H
6 pages
Full Life Planner Interactive
100% (76)
Full Life Planner Interactive
265 pages
Assessment Report
No ratings yet
Assessment Report
25 pages
Beyond AI
100% (7)
Beyond AI
532 pages
Langchain PDF Reader
100% (1)
Langchain PDF Reader
15 pages
The 100+ Business Models by FourWeekMBA - Full Library
97% (35)
The 100+ Business Models by FourWeekMBA - Full Library
780 pages
Oracle Generative AI Services
No ratings yet
Oracle Generative AI Services
17 pages
1732760688350
No ratings yet
1732760688350
29 pages
Six Week-Total Handson Internship Program On Machine Learning
No ratings yet
Six Week-Total Handson Internship Program On Machine Learning
8 pages
226 ChatGPT Prompts A-Z ChatGPT Prompt Engineering BootCamp
93% (15)
226 ChatGPT Prompts A-Z ChatGPT Prompt Engineering BootCamp
120 pages
HW4 Text-1
No ratings yet
HW4 Text-1
8 pages
Better Data Visualizations Scholars
98% (41)
Better Data Visualizations Scholars
464 pages
Sprout Generative AI Report - FINAL
No ratings yet
Sprout Generative AI Report - FINAL
12 pages
List of IEC Standards
No ratings yet
List of IEC Standards
8 pages
Computer Studies: Paper 7010/11 Paper 11
No ratings yet
Computer Studies: Paper 7010/11 Paper 11
16 pages
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
96% (27)
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
52 pages
Limitless
97% (148)
Limitless
355 pages
AI Artificial Intelligence, 60 Leaders 17 Questions
100% (12)
AI Artificial Intelligence, 60 Leaders 17 Questions
236 pages
How To Talk To Anyone About Anything Improve Your Social Skills Master Small Talk
92% (48)
How To Talk To Anyone About Anything Improve Your Social Skills Master Small Talk
103 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (14)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
Unlocking The Potential of ChatGPT
100% (18)
Unlocking The Potential of ChatGPT
45 pages
Data Analytics Concepts Techniques and A PDF
100% (11)
Data Analytics Concepts Techniques and A PDF
451 pages
Lets Learn AI Base Module PDF
86% (14)
Lets Learn AI Base Module PDF
196 pages
Power of Ignored Skills Change The Way You Think and Decide (Manoj Tripathi) (Z-Library)
94% (17)
Power of Ignored Skills Change The Way You Think and Decide (Manoj Tripathi) (Z-Library)
118 pages
Prompt Engineer 101
97% (29)
Prompt Engineer 101
45 pages
Manage Your Day To Day
99% (101)
Manage Your Day To Day
120 pages
Start Now. Get Perfect Later (2018)
100% (26)
Start Now. Get Perfect Later (2018)
240 pages
GenAI_Interview_Questions-Draft
No ratings yet
GenAI_Interview_Questions-Draft
27 pages
AI Agent Index
No ratings yet
AI Agent Index
15 pages
Assignment Nptel
No ratings yet
Assignment Nptel
5 pages
Intern - Gen AI
No ratings yet
Intern - Gen AI
2 pages
Python Programming. A Step-by-Step Guide For Absolute Beginners
93% (43)
Python Programming. A Step-by-Step Guide For Absolute Beginners
181 pages
Top 100 Applications of Generative AI 1683282083
100% (14)
Top 100 Applications of Generative AI 1683282083
119 pages
70 AI Tools To Boost Productivity
83% (24)
70 AI Tools To Boost Productivity
72 pages
Data For GenAI
No ratings yet
Data For GenAI
17 pages
SASMO WORKSHEET April 8
No ratings yet
SASMO WORKSHEET April 8
2 pages
RAG Architecture
100% (7)
RAG Architecture
52 pages
Top Agentic AI Architecture Design Patterns
100% (3)
Top Agentic AI Architecture Design Patterns
8 pages
10WaysToChangeYourMindset PDF
97% (30)
10WaysToChangeYourMindset PDF
7 pages
10 Most Asked LLM Interview Questions
No ratings yet
10 Most Asked LLM Interview Questions
12 pages
ChatGPT Cheat Sheet - DataCamp PDF
91% (11)
ChatGPT Cheat Sheet - DataCamp PDF
78 pages
Diagrammatic Reasoning in AI
100% (6)
Diagrammatic Reasoning in AI
347 pages
The McKinsey Engagement Summary
95% (19)
The McKinsey Engagement Summary
82 pages
Generative AI Checklist
100% (1)
Generative AI Checklist
10 pages
RAG Technics
No ratings yet
RAG Technics
8 pages
Generative AI Usecases - A Comprehensive Guide - Dummies
100% (1)
Generative AI Usecases - A Comprehensive Guide - Dummies
19 pages
Python Machine Learning for Beginners: A Step by Step Approach to Scikit-Learn and TensorFlow
From Everand
Python Machine Learning for Beginners: A Step by Step Approach to Scikit-Learn and TensorFlow
Lena Neill
No ratings yet
PfMP Exam Insights : Q&A with Explanations
From Everand
PfMP Exam Insights : Q&A with Explanations
SUJAN
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet