WIP: DocsGPT POC#12056

kiwicopple · 2023-01-30T16:30:51Z

(copied from OP: #12054)

End-to-end POC for DocsGPT project.

Pre-reqs

Config/secrets

Updated your ./apps/docs/.env.local file with some keys (use .env.sample as a base):

NEXT_PUBLIC_SUPABASE_URL=http://localhost:54321
NEXT_PUBLIC_SUPABASE_ANON_KEY=
OPENAI_KEY=

# TODO: merge META_SUPABASE_URL with NEXT_PUBLIC_SUPABASE_URL
# Currently separate in order to run edge function locally
# (runs in container with different network namespace)
META_SUPABASE_URL=http://supabase_kong_supabase:8000
META_SUPABASE_SERVICE_KEY=

To get the OpenAI key, you will need an OpenAI account and create a key here:
https://beta.openai.com/account/api-keys

Run local Supabase stack

We have extended ./supabase to include DB migrations required for DocsGPT. Be sure to run a local Supabase stack. From the project root:

$ supabase start

Generate embeddings (first time only)

The first time you will need to pre-generate embeddings for the documents (guide-only for now). Simply call the following script from ./apps/docs:

$ npm run build:embeddings

You can safely call this multiple times if you like - it uses a checksum to determine whether or not it has already generated an embedding for each document and will skip if its already there.

In the future this will most likely be called from a CI pipeline.

Note: This does have a (very small) cost every time you run. It queries OpenAI's embeddings endpoint to generate embeddings. If you find yourself constantly restarting your Postgres instance, you can use the following commands to quickly backup/restore without re-generating embeddings every time:

Backup:

pg_dump --column-inserts --data-only -h localhost -p 54322 -U postgres -t page -t page_section > backup.sql

Restore:

psql -h localhost -p 54322 -U postgres -q -f backup.sql

Run edge function

A server side edge function was built to handle DocGPT queries (search embeddings in Postgres, inject as context in prompt, send prompt request to OpenAI).

You will need to run this function locally and pass in the above environment variables. From the project root:

$ supabase functions serve clippy-search --env-file ./apps/docs/.env

Run docs project

Of course we will need to run the docs project to use the frontend. From ./apps/docs:

$ npm run dev

WIP: DocsGPT POC

vercel · 2023-01-30T16:31:00Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated
zone-www-dot-com	✅ Ready (Inspect)	Visit Preview	💬 Add your feedback	Feb 6, 2023 at 9:19PM (UTC)

4 Ignored Deployments

Name	Status	Preview	Updated
about	⬜️ Ignored (Inspect)		Feb 6, 2023 at 9:19PM (UTC)
docs	⬜️ Ignored (Inspect)	Visit Preview	Feb 6, 2023 at 9:19PM (UTC)
supabase-studio-prod	⬜️ Ignored (Inspect)	Visit Preview	Feb 6, 2023 at 9:19PM (UTC)
supabase-studio-staging	⬜️ Ignored (Inspect)	Visit Preview	Feb 6, 2023 at 9:19PM (UTC)

apps/docs/package.json

apps/docs/scripts/generate-embeddings.ts

supabase/functions/clippy-search/index.ts

apps/docs/scripts/generate-embeddings.ts

… feat/docs-gpt-poc

apps/www/_blog/2023-02-03-openai-embeddings-postgres-vector.mdx

package-lock.json

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

… feat/docs-gpt-poc

github-actions · 2023-02-06T21:19:37Z

apps/www/_blog/2023-02-03-openai-embeddings-postgres-vector.mdx

+
+A new PostgreSQL extension is now available in Supabase: [`pgvector`](https://github.com/pgvector/pgvector), an open-source vector similarity search.
+
+The exponential progress of AI functionality over the past year has inspired many new real world applications. One specific challenge has been the ability to store and query _embeddings_ at scale. 


[prettier] _{reported by reviewdog 🐶}

Suggested change

The exponential progress of AI functionality over the past year has inspired many new real world applications. One specific challenge has been the ability to store and query _embeddings_ at scale.
The exponential progress of AI functionality over the past year has inspired many new real world applications. One specific challenge has been the ability to store and query _embeddings_ at scale.

gregnr and others added 12 commits January 27, 2023 00:21

feat(docs-gpt): mdx parsing and initial db setup

c998d8d

feat(docs-gpt): generate embeddings using openai

bc385b9

chore(docs-gpt): move docs supabase project to repo root

d19a6dd

chore: openai package lock version

ee74045

feat(docs-gpt): search modal ui poc

b8a8c59

feat(docs-gpt): end-to-end gpt-3 completion using context injection

607c981

feat(docs-gpt): markdown formatted responses

bfc2795

feat(docs-gpt): gfm plugin

b441c93

feat(docs-gpt): clippy image

f93e97c

chore(docs-gpt): update env variable names

33c3764

chore(docs-gpt): build embeddings script

b652b11

Merge pull request #12054 from gregnr/feat/docs-gpt-poc

99c720d

WIP: DocsGPT POC

kiwicopple requested a review from a team as a code owner January 30, 2023 16:30

vercel bot deployed to Preview – zone-www-dot-com January 30, 2023 16:33 View deployment

Adds an example env

939d185

vercel bot had a problem deploying to Preview – docs January 30, 2023 16:39 Failure

vercel bot deployed to Preview – supabase-studio-staging January 30, 2023 16:41 View deployment

vercel bot deployed to Preview – supabase-studio-prod January 30, 2023 16:42 View deployment

gregnr reviewed Jan 30, 2023

View reviewed changes

apps/docs/package.json Show resolved Hide resolved

gregnr reviewed Jan 30, 2023

View reviewed changes

apps/docs/scripts/generate-embeddings.ts Show resolved Hide resolved

gregnr reviewed Jan 30, 2023

View reviewed changes

supabase/functions/clippy-search/index.ts Outdated Show resolved Hide resolved

gregnr reviewed Jan 30, 2023

View reviewed changes

apps/docs/scripts/generate-embeddings.ts Show resolved Hide resolved

saltcod added 3 commits January 31, 2023 11:53

Merge branch 'master' into feat/docs-gpt-poc

c7285b8

Remove console.log

176eac3

Handle empty code block

50c426d

vercel bot deployed to Preview – supabase-studio-staging January 31, 2023 18:17 View deployment

vercel bot deployed to Preview – supabase-studio-prod January 31, 2023 18:17 View deployment

vercel bot deployed to Preview – docs January 31, 2023 18:18 View deployment

vercel bot deployed to Preview – zone-www-dot-com January 31, 2023 18:19 View deployment

kiwicopple added 2 commits February 6, 2023 21:10

updates blog post

442d871

Merge branch 'feat/docs-gpt-poc' of github.com:supabase/supabase into…

be3e4b0

… feat/docs-gpt-poc

vercel bot had a problem deploying to Preview – docs February 6, 2023 20:12 Failure

vercel bot deployed to Preview – zone-www-dot-com February 6, 2023 20:13 View deployment

saltcod added 2 commits February 6, 2023 16:49

Move flag

64ee052

Merge branch 'feat/docs-gpt-poc' of github.com:supabase/supabase into…

0c78eaf

… feat/docs-gpt-poc

vercel bot had a problem deploying to Preview – docs February 6, 2023 20:21 Failure

Fix import

d19553f

vercel bot deployed to Preview – docs February 6, 2023 20:29 View deployment

saltcod approved these changes Feb 6, 2023

View reviewed changes

saltcod added 2 commits February 6, 2023 17:13

Merge branch 'master' of github.com:supabase/supabase

df51f2e

Merge branch 'master' into feat/docs-gpt-poc

bdfbb96

github-actions bot reviewed Feb 6, 2023

View reviewed changes

vercel bot deployed to Preview – zone-www-dot-com February 6, 2023 20:48 View deployment

vercel bot deployed to Preview – supabase-studio-staging February 6, 2023 20:52 View deployment

vercel bot deployed to Preview – supabase-studio-prod February 6, 2023 20:53 View deployment

vercel bot deployed to Preview – docs February 6, 2023 20:53 View deployment

kiwicopple and others added 3 commits February 6, 2023 21:57

Update apps/www/_blog/2023-02-03-openai-embeddings-postgres-vector.mdx

01719e2

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update apps/www/_blog/2023-02-03-openai-embeddings-postgres-vector.mdx

482e320

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update apps/www/_blog/2023-02-03-openai-embeddings-postgres-vector.mdx

b3baba3

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

vercel bot deployed to Preview – zone-www-dot-com February 6, 2023 21:02 View deployment

kiwicopple added 2 commits February 6, 2023 22:03

trying ignore blocks

0e99fa4

Merge branch 'feat/docs-gpt-poc' of github.com:supabase/supabase into…

937c5db

… feat/docs-gpt-poc

vercel bot had a problem deploying to Preview – zone-www-dot-com February 6, 2023 21:06 Failure

kiwicopple added 2 commits February 6, 2023 22:11

ignore mdx

90a52c4

Deploy

55ad0fd

vercel bot deployed to Preview – zone-www-dot-com February 6, 2023 21:19 View deployment

github-actions bot reviewed Feb 6, 2023

View reviewed changes

kiwicopple merged commit b0b3212 into master Feb 6, 2023

kiwicopple deleted the feat/docs-gpt-poc branch February 6, 2023 21:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: DocsGPT POC#12056

WIP: DocsGPT POC #12056

kiwicopple commented Jan 30, 2023 •
edited

vercel bot commented Jan 30, 2023 •
edited

github-actions bot Feb 6, 2023


		A new PostgreSQL extension is now available in Supabase: [`pgvector`](https://github.com/pgvector/pgvector), an open-source vector similarity search.

		The exponential progress of AI functionality over the past year has inspired many new real world applications. One specific challenge has been the ability to store and query _embeddings_ at scale.

WIP: DocsGPT POC#12056

WIP: DocsGPT POC #12056

Conversation

kiwicopple commented Jan 30, 2023 • edited

Pre-reqs

Config/secrets

Run local Supabase stack

Generate embeddings (first time only)

Run edge function

Run docs project

vercel bot commented Jan 30, 2023 • edited

github-actions bot Feb 6, 2023

Choose a reason for hiding this comment

kiwicopple commented Jan 30, 2023 •
edited

vercel bot commented Jan 30, 2023 •
edited