feat(api): reranking backend integrated in with rag #1090

CollectiveUnicorn · 2024-09-20T22:21:49Z

Description

Adds reranking to RAG pipeline
Adds RAG configuration endpoint when in dev mode
Adds additional logging
Refactors the pytest's test_routes api tests
Alters default RAG values into two steps, retrieval and ranking. With the retrieval results being set to 100 after ranking the results are filtered down to the user specified k value. If reranking is not enabled, the user specified k results is returned from the retrieval step.
Adds Zarf configs to enable dev mode.

FlashRank Evals

Rerank (ms-marco-TinyBERT-L-2-v2) + 100 top-k retrieval

Final Results:
INFO:root:Average Needle in a Haystack (NIAH) Retrieval: 1.0
INFO:root:Average Needle in a Haystack (NIAH) Response: 1.0
INFO:root:Average Correctness (GEval): 0.812
INFO:root:Average Answer Relevancy: 0.9590000000000001
INFO:root:Average Annotation Relevancy: 0.92
INFO:root:MMLU: 0.697979797979798
INFO:root:HumanEval: 0.96
INFO:root:Eval Execution Runtime (seconds): 3122.107043981552

Rerank (ms-marco-TinyBERT-L-2-v2) + 10 top-k retrieval

Final Results:
INFO:root:Average Needle in a Haystack (NIAH) Retrieval: 1.0
INFO:root:Average Needle in a Haystack (NIAH) Response: 1.0
INFO:root:Average Correctness (GEval): 0.8140000000000001
INFO:root:Average Answer Relevancy: 0.9470000000000001
INFO:root:Average Annotation Relevancy: 0.92
INFO:root:MMLU: 0.697979797979798
INFO:root:HumanEval: 0.96
INFO:root:Eval Execution Runtime (seconds): 3119.72075009346

No Rerank

Final Results:
INFO:root:Average Needle in a Haystack (NIAH) Retrieval: 1.0
INFO:root:Average Needle in a Haystack (NIAH) Response: 1.0
INFO:root:Average Correctness (GEval): 0.8299999999999998
INFO:root:Average Answer Relevancy: 0.9555128205128205
INFO:root:Average Annotation Relevancy: 0.92
INFO:root:MMLU: 0.697979797979798
INFO:root:HumanEval: 0.96
INFO:root:Eval Execution Runtime (seconds): 3112.184923171997

Baseline - No Rerank

Final Results:
INFO:root:Average Needle in a Haystack (NIAH) Retrieval: 1.0
INFO:root:Average Needle in a Haystack (NIAH) Response: 1.0
INFO:root:Average Correctness (GEval): 0.81
INFO:root:Average Answer Relevancy: 0.9616666666666666
INFO:root:Average Annotation Relevancy: 0.92
INFO:root:MMLU: 0.695959595959596
INFO:root:HumanEval: 0.96
INFO:root:Eval Execution Runtime (seconds): 1961.5159051418304

Checklist before merging

Tests, documentation, ADR added or updated as needed
Followed the Contributor Guide Steps

netlify · 2024-09-20T22:22:06Z

✅ Deploy Preview for leapfrogai-docs canceled.

Name	Link
🔨 Latest commit	`01e77df`
🔍 Latest deploy log	https://app.netlify.com/sites/leapfrogai-docs/deploys/66fc411a06886c000814d563

… down

jalling97 · 2024-09-27T19:30:36Z

The Annotation Relevancy metric is what I was initially concerned with when I saw the results. I would have expected it to change (in either direction) after reranking was implemented. However, from what I can tell, the top_k chunks (with or without reranking) correspond to the same documents either way, resulting in no practical difference in annotations. Since the Annotation Relevancy metric isn't concerned with the order of the documents (just that the ones you expect are in the list of annotations), the results are pretty much unchanged.

This is good to note, as the next stage of evals will incorporate the chunk data thanks to #1164, so we can better evaluate the rank of the chunks themselves.

Overall I think this is a net gain and we'll have to see with the next wave of evals what practical difference the reranker has.

…th-rag

tests/utils/client.py

tests/integration/api/test_rag_files.py

CollectiveUnicorn added 2 commits September 20, 2024 14:52

Initial reranking setup

b63f085

Naive reranking implemented with query

09fd6c3

CollectiveUnicorn linked an issue Sep 20, 2024 that may be closed by this pull request

feat: reranking backend integrated in with RAG #1089

Closed

3 tasks

Adds endpoint to check current rag configuraiton

f14e22d

CollectiveUnicorn added the enhancement New feature or request label Sep 20, 2024

Fixes typo

5a9bb34

CollectiveUnicorn self-assigned this Sep 20, 2024

CollectiveUnicorn added 22 commits September 23, 2024 11:47

Adds route to fast api router

10163ac

Fixes issue in endpoint configs

a82b596

Ensures that the class level variable has a default value

6b6eb82

Makes the enable_reranking var a classvar

7337762

Creates separate response type

88cdbca

Additional comments

75c42f8

Cleans up comments and uses correct class for post requests

329a296

Adds the model config to the search endpoint so that it can be passed…

d19be30

… down

Adds output to evaluate reranking, refactors class

be75d01

Adds more output

ebe977c

More logging

2e85b5e

Refactors logging and adds additional outputs

40a8061

Updates the similarity measure after reranking

d74ba5f

Adds additional logging

8ad5216

Improves readability of logging

4634ed0

Simply debug output for further readability

5e4837a

Change user prompt to system prompt

9e051d7

Replaces custom reranker with library and llm with FlashRank

e8316c1

Fixes invalid dictionary index

b355e86

Adds more ranking models and configuration for ranking models

77b7249

Adds score and rank to search item response

75e70fd

Ensures that the configured model is used when ranking

2db8263

CollectiveUnicorn added 6 commits September 27, 2024 11:23

Make deep copy to prevent issues with variables overwriting

0872a5e

Fixes update logic

0fb97b8

Moves the update function to the payload class

b86fdf3

Prevents default overwriting

825df2a

Prevents default overwriting

cec2a26

Adds to configuration test

588b3ec

CollectiveUnicorn requested review from justinthelaw and jalling97 September 27, 2024 19:31

jalling97 previously approved these changes Sep 30, 2024

View reviewed changes

Merge branch 'main' into 1089-feat-reranking-backend-integrated-in-wi…

e55e68f

…th-rag

CollectiveUnicorn dismissed jalling97’s stale review via e55e68f September 30, 2024 16:04

CollectiveUnicorn and others added 5 commits September 30, 2024 09:05

Update test_rag_files.py

f8e6d20

Adds small fixes

5a23a51

Merge branch 'main' into 1089-feat-reranking-backend-integrated-in-wi…

0fb0353

…th-rag

Ruff linting

0db333e

Merge branch 'main' into 1089-feat-reranking-backend-integrated-in-wi…

d87e247

…th-rag

justinthelaw requested changes Oct 1, 2024

View reviewed changes

tests/utils/client.py Outdated Show resolved Hide resolved

tests/utils/client.py Outdated Show resolved Hide resolved

tests/utils/client.py Outdated Show resolved Hide resolved

Fix unnecessary environment variables

2cbe4bd

jalling97 previously approved these changes Oct 1, 2024

View reviewed changes

justinthelaw reviewed Oct 1, 2024

View reviewed changes

tests/integration/api/test_rag_files.py Outdated Show resolved Hide resolved

Swaps env out with new helper function

01e77df

CollectiveUnicorn dismissed jalling97’s stale review via 01e77df October 1, 2024 18:36

CollectiveUnicorn requested review from jalling97 and justinthelaw October 1, 2024 18:36

justinthelaw approved these changes Oct 1, 2024

View reviewed changes

jalling97 approved these changes Oct 1, 2024

View reviewed changes

CollectiveUnicorn merged commit 2f80d87 into main Oct 1, 2024
29 checks passed

CollectiveUnicorn deleted the 1089-feat-reranking-backend-integrated-in-with-rag branch October 1, 2024 19:34

github-actions bot mentioned this pull request Oct 1, 2024

chore(main): release 0.14.0 #1160

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(api): reranking backend integrated in with rag #1090

feat(api): reranking backend integrated in with rag #1090

CollectiveUnicorn commented Sep 20, 2024 •

edited

Loading

netlify bot commented Sep 20, 2024 •

edited

Loading

jalling97 commented Sep 27, 2024 •

edited by gphorvath

Loading

feat(api): reranking backend integrated in with rag #1090

feat(api): reranking backend integrated in with rag #1090

Conversation

CollectiveUnicorn commented Sep 20, 2024 • edited Loading

Description

FlashRank Evals

Checklist before merging

netlify bot commented Sep 20, 2024 • edited Loading

✅ Deploy Preview for leapfrogai-docs canceled.

jalling97 commented Sep 27, 2024 • edited by gphorvath Loading

CollectiveUnicorn commented Sep 20, 2024 •

edited

Loading

netlify bot commented Sep 20, 2024 •

edited

Loading

jalling97 commented Sep 27, 2024 •

edited by gphorvath

Loading