Skip to content

Commit 6826c22

Browse files
authored
Montana/nl (#1382)
1 parent 85c078d commit 6826c22

6 files changed

+2
-21
lines changed

pgml-cms/blog/announcing-support-for-aws-us-east-1-region.md

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -27,14 +27,8 @@ To demonstrate the impact of moving the data closer to your application, we've c
2727

2828
<figure><img src=".gitbook/assets/image (8).png" alt=""><figcaption></figcaption></figure>
2929

30-
\
31-
32-
3330
<figure><img src=".gitbook/assets/image (9).png" alt=""><figcaption></figcaption></figure>
3431

35-
\
36-
37-
3832
## Using the New Region
3933

4034
To take advantage of latency savings, you can [deploy a dedicated PostgresML database](https://postgresml.org/signup) in `us-east-1` today. We make it as simple as filling out a very short form and clicking "Create database".

pgml-cms/blog/generating-llm-embeddings-with-open-source-models-in-postgresml.md

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ Montana Low
1818

1919
April 21, 2023
2020

21-
PostgresML makes it easy to generate embeddings from text in your database using a large selection of state-of-the-art models with one simple call to **`pgml.embed`**`(model_name, text)`. Prove the results in this series to your own satisfaction, for free, by signing up for a GPU accelerated database.
21+
PostgresML makes it easy to generate embeddings from text in your database using a large selection of state-of-the-art models with one simple call to `pgml.embed(model_name, text)`. Prove the results in this series to your own satisfaction, for free, by signing up for a GPU accelerated database.
2222

2323
This article is the first in a multipart series that will show you how to build a post-modern semantic search and recommendation engine, including personalization, using open source models.
2424

@@ -216,9 +216,6 @@ For comparison, it would cost about $299 to use OpenAI's cheapest embedding mode
216216
| GPU | 17ms | $72 | 6 hours |
217217
| OpenAI | 300ms | $299 | millennia |
218218

219-
\
220-
221-
222219
You can also find embedding models that outperform OpenAI's `text-embedding-ada-002` model across many different tests on the [leaderboard](https://huggingface.co/spaces/mteb/leaderboard). It's always best to do your own benchmarking with your data, models, and hardware to find the best fit for your use case.
223220

224221
> _HTTP requests to a different datacenter cost more time and money for lower reliability than co-located compute and storage._

pgml-cms/blog/meet-us-at-the-2024-postgres-conference.md

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,6 @@ Why should you care? It's not every day you get to dive headfirst into the world
2222
Save 25% on your ticket with our discount code: 2024\_POSTGRESML\_25
2323
{% endhint %}
2424

25-
\
2625
PostgresML CEO and founder, Montana Low, will kick off the event on April 17th with a keynote about navigating the confluence of hardware evolution and machine learning technology.&#x20;
2726

2827
We’ll also be hosting a masterclass in retrieval augmented generation (RAG) on April 18th. Our own Silas Marvin will give hands-on guidance to equip you with the ability to implement RAG directly within your database.&#x20;
@@ -36,5 +35,3 @@ If you’d like some 1:1 time with our team at PgConf [contact us here](https://
3635
So, why sit on the sidelines when you could be right in the thick of it, soaking up knowledge, making connections, and maybe even stumbling upon your next big breakthrough? Clear your schedule, grab your ticket, and get ready to geek out with us in San Jose.
3736

3837
See you there!
39-
40-
\

pgml-cms/blog/mindsdb-vs-postgresml.md

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -47,9 +47,6 @@ Both Projects integrate several dozen machine learning algorithms, including the
4747
| Full Text Search | - ||
4848
| Geospatial Search | - ||
4949

50-
\
51-
52-
5350
Both MindsDB and PostgresML support many classical machine learning algorithms to do classification and regression. They are both able to load ~~the latest LLMs~~ some models from Hugging Face, supported by underlying implementations in libtorch. I had to cross that out after exploring all the caveats in the MindsDB implementations. PostgresML supports the models released immediately as long as underlying dependencies are met. MindsDB has to release an update to support any new models, and their current model support is extremely limited. New algorithms, tasks, and models are constantly released, so it's worth checking the documentation for the latest list.
5451

5552
Another difference is that PostgresML also supports embedding models, and closely integrates them with vector search inside the database, which is well beyond the scope of MindsDB, since it's not a database at all. PostgresML has direct access to all the functionality provided by other Postgres extensions, like vector indexes from [pgvector](https://github.com/pgvector/pgvector) to perform efficient KNN & ANN vector recall, or [PostGIS](http://postgis.net/) for geospatial information as well as built in full text search. Multiple algorithms and extensions can be combined in compound queries to build state-of-the-art systems, like search and recommendations or fraud detection that generate an end to end result with a single query, something that might take a dozen different machine learning models and microservices in a more traditional architecture.
@@ -300,9 +297,6 @@ PostgresML is the clear winner in terms of performance. It seems to me that it c
300297
| translation\_en\_to\_es | t5-base | 1573 | 1148 | 294 |
301298
| summarization | sshleifer/distilbart-cnn-12-6 | 4289 | 3450 | 479 |
302299

303-
\
304-
305-
306300
There is a general trend, the larger and slower the model is, the more work is spent inside libtorch, the less the performance of the rest matters, but for interactive models and use cases there is a significant difference. We've tried to cover the most generous use case we could between these two. If we were to compare XGBoost or other classical algorithms, that can have sub millisecond prediction times in PostgresML, the 20ms Python service overhead of MindsDB just to parse the incoming query would be hundreds of times slower.
307301

308302
## Clouds

pgml-cms/docs/api/apis.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ We also provide Client SDKs that implement the best practices on top of the SQL
1111
## SQL Extension
1212

1313
PostgreSQL is designed to be _**extensible**_. This has created a rich open-source ecosystem of additional functionality built around the core project. Some [extensions](https://www.postgresql.org/docs/current/contrib.html) are include in the base Postgres distribution, but others are also available via the [PostgreSQL Extension Network](https://pgxn.org/).\
14-
\
14+
1515
There are 2 foundational extensions included in a PostgresML deployment that provide functionality inside the database through SQL APIs.
1616

1717
* **pgml** - provides Machine Learning and Artificial Intelligence APIs with access to more than 50 ML algorithms to train classification, clustering and regression models on your own data, or you can perform dozens of tasks with thousands of models downloaded from HuggingFace.

pgml-cms/docs/resources/benchmarks/mindsdb-vs-postgresml.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,6 @@ Both Projects integrate several dozen machine learning algorithms, including the
3535
| Full Text Search | - ||
3636
| Geospatial Search | - ||
3737

38-
\
3938
Both MindsDB and PostgresML support many classical machine learning algorithms to do classification and regression. They are both able to load ~~the latest LLMs~~ some models from Hugging Face, supported by underlying implementations in libtorch. I had to cross that out after exploring all the caveats in the MindsDB implementations. PostgresML supports the models released immediately as long as underlying dependencies are met. MindsDB has to release an update to support any new models, and their current model support is extremely limited. New algorithms, tasks, and models are constantly released, so it's worth checking the documentation for the latest list.
4039

4140
Another difference is that PostgresML also supports embedding models, and closely integrates them with vector search inside the database, which is well beyond the scope of MindsDB, since it's not a database at all. PostgresML has direct access to all the functionality provided by other Postgres extensions, like vector indexes from [pgvector](https://github.com/pgvector/pgvector) to perform efficient KNN & ANN vector recall, or [PostGIS](http://postgis.net/) for geospatial information as well as built in full text search. Multiple algorithms and extensions can be combined in compound queries to build state-of-the-art systems, like search and recommendations or fraud detection that generate an end to end result with a single query, something that might take a dozen different machine learning models and microservices in a more traditional architecture.

0 commit comments

Comments
 (0)