Pgai Brings Your ML Workload To The Database
Written by Nikos Vaggalis   
Monday, 15 July 2024

Extensions like pgai are targeted at the "AI Engineers", a new breed of developers who unlike researchers are concerned with practically applying AI (models, tools, and APIs) to build software.

These engineers don't want to leave their favorite tools for other tools dedicated to AI. Replace "tool" with Postgres and you'll see where this is going.

In "Azure AI And Pgvector Run Generative AI Directly On Postgres" we witnessed the exact case in point. The Azure AI extension enables the database to call into various Azure AI services like Azure OpenAI.

In short, what this means is that you can generate text embeddings by using SQL queries to call into both Azure OpenAI services without needing a separate application layer. It's all about making the developer's life easy by shifting the Machine Learning workload onto the database. That is, you provided with the ability to run ML models on your live data, without leaving the database.

The new pgai open-source extension by Timescale does that exactly. It brings embedding and generation models closer to the database, making PostgreSQL an even better database for AI applications, catering for the needs of AI Engineers as stated in its official announcement:

We spoke to a Timescale customer who was tasked with building a RAG application at their company. This engineer had decades of database and C experience but complained to us about fumbling around trying to learn Python to quickly build a proof-of-concept (PoC) to validate the core idea. “I wish I could do more of this in SQL” was a common refrain we heard from this and many other engineers we spoke to, ultimately sparking the idea for pgai.

Wish granted!

With pgai, you can now do the following directly from within PostgreSQL in a SQL query:

  • Create embeddings for your data.
  • Retrieve LLM chat completions from models like OpenAI GPT4o and Llama 3.
  • Generate responses for models such as Ollama.
  • Reason over your data and facilitate use cases like classification, summarization, and data enrichment on your existing relational data in PostgreSQL.

For instance, doing Chat completion to return the content as text from a specific message :

The data returned looks like:

Many examples to get you started can be found on the extension's Github repo, like using pgai with OpenAI.

The initial release of pgai supports creating OpenAI embeddings and getting OpenAI chat completions from models like GPT4o directly from your PostgreSQL database. The plan is to add support for more models in the future such as Claude and Cohere.

At this point I'd like to note down that there's another extension, PostgresML, which shares a very similar if not the same purpose. As we examined in which we examined in "PostgresML - Bring Your ML Workload To The Database", PostgresML is a machine learning extension for PostgreSQL that enables you to perform training and inference on text and tabular data using SQL queries. So make sure to check that article out to figure out which of the two extensions fits your purpose best.

The gist, however, is that with extensions like this you can utilize PostgreSQL for AI as well without going out of your way. AI Engineers rejoice!

 

More Information

pgai Github

Pgai: Giving PostgreSQL Developers AI Engineering Superpowers
 

Related Articles

PostgresML - Bring Your ML Workload To The Database

Azure AI And Pgvector Run Generative AI Directly On Postgres

 

 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


OpenAI Releases Swarm
25/10/2024

OpenAI has released an experimental educational framework for exploring ergonomic, lightweight multi-agent orchestration. Swarm is managed by the OpenAI Solution team, but is not intended to be used i [ ... ]



Microsoft Releases Dev Proxy 0.21
08/10/2024

Microsoft has released an update to Dev Proxy, its command-line API simulator. The updated version, v0.21, adds the ability to simulate authentication and authorization using API keys and OAuth2 among [ ... ]


More News

espbook

 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Monday, 15 July 2024 )