Skip to content

Ingest API data via Langchain, embed your API data into a private Chroma DB hosted on AWS, and chat with your data via OpenAI

Notifications You must be signed in to change notification settings

arndvs/gpt4-langchain-ingest-api-data-private-chroma-aws

 
 

Repository files navigation

GPT-4, LangChain, Private Chroma DB Deployed to AWS, Ingesting Data Via API

Tech stack used includes LangChain, Private Chroma DB Deployed to AWS, Typescript, Openai, and Next.js. LangChain is a framework that makes it easier to build scalable AI/LLM apps and chatbots. Chroma is an opensource vectorstore for storing embeddings and your API data.

How to Deploy Private Chroma Vector DB to AWS video

Development

  1. Clone the repo or download the ZIP
git clone [github https url]
  1. Install packages

First run npm install yarn -g to install yarn globally (if you haven't already).

Then run:

yarn install

After installation, you should now see a node_modules folder.

  1. Set up your .env file
  • Copy .env.example into .env Your .env file should look like this:
OPENAI_API_KEY=

CHROMA_AWS_API_TOKEN=
CHROMA_AWS_API_GATEWAY_URL=

CHROMA_AXIOS_API_URL=
CHROMA_AXIOS_API_TOKEN=
CHROMA_COLLECTION_NAME=

  • Visit openai to retrieve API keys and insert into your .env file.
  • Visit Chroma

Credit

Frontend of this repo is inspired by langchain-chat-nextjs Large portions of this repo is inspired by gpt4-pdf-chatbot-langchain How to Deploy Private Chroma Vector DB to AWS Deploy a PRIVATE Chroma Vector DB to AWS | Step by step tutorial | Part 2

About

Ingest API data via Langchain, embed your API data into a private Chroma DB hosted on AWS, and chat with your data via OpenAI

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 86.5%
  • CSS 12.2%
  • JavaScript 1.3%