pgml chat blog #914

santiatpml · 2023-08-11T22:38:23Z

No description provided.

…postgresml into santi-pgml-chat-blog

levkk · 2023-08-13T22:18:12Z

Really cool post! Kind of an eye opener for me for this use case.

montanalow · 2023-08-14T02:57:23Z

.../content/blog/pgml-chat-an-alternative-to-langchain-to-deploy-knowledge-based-chat-agents.md

+
+With its knowledge base in place, now the chatbot links to models that allow natural conversations:
+
+- Based on users' questions, querying the indexed chunks to rapidly pull the most relevant passages.


Should this be a numeric list?

montanalow · 2023-08-14T02:58:54Z

.../content/blog/pgml-chat-an-alternative-to-langchain-to-deploy-knowledge-based-chat-agents.md

+
+!!!
+
+3. Copy the template file to `.env`


Could use cp .env.template .env

montanalow · 2023-08-14T03:00:20Z

.../content/blog/pgml-chat-an-alternative-to-langchain-to-deploy-knowledge-based-chat-agents.md

+
+!!! code_block
+
+```bash


@chillenberger I'm struggling to remember why we need three levels of nesting in markdown, to represent every single code block. Seems like the default style for triple backticks should "handle it".

montanalow

🚀

montanalow · 2023-08-16T00:42:45Z

...g/pgml-chat-a-command-line-tool-for-deploying-low-latency-knowledge-based-chatbots-part-I.md

+
+
+# Introduction
+Language models like GPT-3 seem really intelligent at first, but they have a huge blindspot - no external knowledge or memory. Ask them about current events or niche topics and they just can't keep up. To be truly useful in real applications, these large language models (LLMs) need knowledge added to them somehow. The trick is getting them that knowledge fast enough to have natural conversations. Open source tools like LangChain try to help by giving language models more context and knowledge. But they end up glueing together different services into a complex patchwork. This leads to a lot of infrastructure overhead, maintenance needs, and slow response times that hurt chatbot performance. We need a better solution tailored specifically for chatbots to inject knowledge in a way that's fast, relevant and integrated.


Suggested change

Language models like GPT-3 seem really intelligent at first, but they have a huge blindspot - no external knowledge or memory. Ask them about current events or niche topics and they just can't keep up. To be truly useful in real applications, these large language models (LLMs) need knowledge added to them somehow. The trick is getting them that knowledge fast enough to have natural conversations. Open source tools like LangChain try to help by giving language models more context and knowledge. But they end up glueing together different services into a complex patchwork. This leads to a lot of infrastructure overhead, maintenance needs, and slow response times that hurt chatbot performance. We need a better solution tailored specifically for chatbots to inject knowledge in a way that's fast, relevant and integrated.

Language models like GPT-4 seem really intelligent at first, but they have a huge blindspot - no external knowledge or memory. Ask them about current events or niche topics and they just can't keep up. To be truly useful in real applications, these large language models (LLMs) need knowledge added to them somehow. The trick is getting them that knowledge fast enough to have natural conversations. Open source tools like LangChain and LlamaIndex try to help by giving language models more context and knowledge. But they end up glueing together different services into a complex patchwork. This leads to a lot of infrastructure overhead, maintenance needs, and slow response times that hurt chatbot performance. We need a better solution tailored specifically for chatbots to inject knowledge in a way that's fast, relevant and integrated.

montanalow · 2023-08-16T00:43:07Z

...g/pgml-chat-a-command-line-tool-for-deploying-low-latency-knowledge-based-chatbots-part-I.md

+# Introduction
+Language models like GPT-3 seem really intelligent at first, but they have a huge blindspot - no external knowledge or memory. Ask them about current events or niche topics and they just can't keep up. To be truly useful in real applications, these large language models (LLMs) need knowledge added to them somehow. The trick is getting them that knowledge fast enough to have natural conversations. Open source tools like LangChain try to help by giving language models more context and knowledge. But they end up glueing together different services into a complex patchwork. This leads to a lot of infrastructure overhead, maintenance needs, and slow response times that hurt chatbot performance. We need a better solution tailored specifically for chatbots to inject knowledge in a way that's fast, relevant and integrated.
+
+In the first part of this blog series, we will talk about deploying a chatbot using `pgml-chat` command line tool. In the second part, we will show how `pgml-chat` works under the hood and focus on achieving low-latencies.


Suggested change

In the first part of this blog series, we will talk about deploying a chatbot using `pgml-chat` command line tool. In the second part, we will show how `pgml-chat` works under the hood and focus on achieving low-latencies.

In the first part of this blog series, we will talk about deploying a chatbot using the `pgml-chat` command line tool. In the second part, we will show how `pgml-chat` works under the hood and focus on achieving low-latencies.

montanalow · 2023-08-16T00:44:28Z

...g/pgml-chat-a-command-line-tool-for-deploying-low-latency-knowledge-based-chatbots-part-I.md

+2. Passing those passages to a model like GPT-3 to generate conversational responses.
+3. Orchestrating the query, retrieval and generation flow to enable real-time chat.
+
+## 3. Evaluating and Fine-tuning chatbot


Suggested change

## 3. Evaluating and Fine-tuning chatbot

## 3. Evaluating and Fine-tuning the chatbot

montanalow · 2023-08-16T00:44:39Z

...g/pgml-chat-a-command-line-tool-for-deploying-low-latency-knowledge-based-chatbots-part-I.md

+
+## 3. Evaluating and Fine-tuning chatbot
+
+Chatbot needs to be evaluated and fine-tuned before it can be deployed to the real world. This involves:


Suggested change

Chatbot needs to be evaluated and fine-tuned before it can be deployed to the real world. This involves:

The chatbot needs to be evaluated and fine-tuned before it can be deployed to the real world. This involves:

santiadavani added 3 commits August 9, 2023 11:20

kicked off blog

39a1734

minor updates

1493bbd

First draft for pgml chat blog

f6c9d64

santiatpml requested a review from montanalow August 11, 2023 22:38

santiadavani added 6 commits August 11, 2023 15:39

Updated gitignore

0c43943

kicked off blog

83a3ebc

minor updates

e3253fa

First draft for pgml chat blog

a938317

Updated gitignore

505a20f

Merge branch 'santi-pgml-chat-blog' of https://github.com/postgresml/…

8a3640a

…postgresml into santi-pgml-chat-blog

montanalow reviewed Aug 14, 2023

View reviewed changes

santiadavani added 5 commits August 14, 2023 14:34

timings benchmark scripts v1

dd34854

Added requirements

029fa05

Added latency results

296eb30

query comparison plot

45e5493

verified rendering locally

c5e3dc4

santiatpml requested a review from montanalow August 15, 2023 23:59

updated embeddings and query to reflect e5-large model

08e44b6

montanalow approved these changes Aug 16, 2023

View reviewed changes

Intro updates and new image

be0c239

santiatpml merged commit 22e9b4e into master Aug 16, 2023

santiatpml deleted the santi-pgml-chat-blog branch August 16, 2023 19:43

kczimm pushed a commit that referenced this pull request Aug 21, 2023

pgml chat blog (#914)

b41d893

SilasMarvin pushed a commit that referenced this pull request Oct 5, 2023

pgml chat blog (#914)

54fd582

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

pgml chat blog #914

pgml chat blog #914

Uh oh!

santiatpml commented Aug 11, 2023

Uh oh!

levkk commented Aug 13, 2023

Uh oh!

montanalow Aug 14, 2023

Uh oh!

montanalow Aug 14, 2023

Uh oh!

montanalow Aug 14, 2023

Uh oh!

montanalow left a comment

Uh oh!

montanalow Aug 16, 2023

Uh oh!

montanalow Aug 16, 2023

Uh oh!

montanalow Aug 16, 2023

Uh oh!

montanalow Aug 16, 2023

Uh oh!

Uh oh!


		With its knowledge base in place, now the chatbot links to models that allow natural conversations:

		- Based on users' questions, querying the indexed chunks to rapidly pull the most relevant passages.



		# Introduction
		Language models like GPT-3 seem really intelligent at first, but they have a huge blindspot - no external knowledge or memory. Ask them about current events or niche topics and they just can't keep up. To be truly useful in real applications, these large language models (LLMs) need knowledge added to them somehow. The trick is getting them that knowledge fast enough to have natural conversations. Open source tools like LangChain try to help by giving language models more context and knowledge. But they end up glueing together different services into a complex patchwork. This leads to a lot of infrastructure overhead, maintenance needs, and slow response times that hurt chatbot performance. We need a better solution tailored specifically for chatbots to inject knowledge in a way that's fast, relevant and integrated.

	In the first part of this blog series, we will talk about deploying a chatbot using `pgml-chat` command line tool. In the second part, we will show how `pgml-chat` works under the hood and focus on achieving low-latencies.
	In the first part of this blog series, we will talk about deploying a chatbot using the `pgml-chat` command line tool. In the second part, we will show how `pgml-chat` works under the hood and focus on achieving low-latencies.

	## 3. Evaluating and Fine-tuning chatbot
	## 3. Evaluating and Fine-tuning the chatbot


		## 3. Evaluating and Fine-tuning chatbot

		Chatbot needs to be evaluated and fine-tuned before it can be deployed to the real world. This involves:

	Chatbot needs to be evaluated and fine-tuned before it can be deployed to the real world. This involves:
	The chatbot needs to be evaluated and fine-tuned before it can be deployed to the real world. This involves:


		!!! code_block

		```bash

pgml chat blog #914

pgml chat blog #914

Uh oh!

Conversation

santiatpml commented Aug 11, 2023

Uh oh!

levkk commented Aug 13, 2023

Uh oh!

montanalow Aug 14, 2023

Choose a reason for hiding this comment

Uh oh!

montanalow Aug 14, 2023

Choose a reason for hiding this comment

Uh oh!

montanalow Aug 14, 2023

Choose a reason for hiding this comment

Uh oh!

montanalow left a comment

Choose a reason for hiding this comment

Uh oh!

montanalow Aug 16, 2023

Choose a reason for hiding this comment

Uh oh!

montanalow Aug 16, 2023

Choose a reason for hiding this comment

Uh oh!

montanalow Aug 16, 2023

Choose a reason for hiding this comment

Uh oh!

montanalow Aug 16, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!