Software

2 minute read

Master LLM Hallucinations 💭

December 15, 2023

Building with AI and LLMs is now a must-know for every developer. Every application is trying to integrate AI models. But hallucinations — the phenomenon of AI models generating incorrect or unverified information — are still an unsolved problem.

“Ughh ChatGPT – I told you to NOT make stuff up!”

Andrej Karpathy shared recently his take on hallucinations on Twitter:

“The LLM has no “hallucination problem”. Hallucination is not a bug, it is LLM’s greatest feature. The LLM Assistant has a hallucination problem, and we should fix it.”

So how do we fix it?

Chain-of-Verification (CoVe), a technique introduced by researchers at Meta, is one way. Let’s dive into the high-level process of CoVe and then explore how we implemented a CoVe prompt template using AIConfig that you can use to reduce hallucinations in your LLM-powered apps.

The Chain-of-Verification (CoVe) Process 🔗

As documented in the white paper, the process involves 4 crucial steps:

1️⃣ Generate Baseline: Given a query, the Large Language Model (LLM) generates a response.
2️⃣ Plan Verification(s): With the query and baseline response, the system formulates a list of verification questions. These would aid in analyzing potential inaccuracies within the original response.
3️⃣ Execute Verification(s): Each verification question is answered, and then cross-checked against the original response to discern inconsistencies or flaws.
4️⃣ Generate Final Response: If inconsistencies are found, a revised response is generated, factoring in the results from the verification process.

Integrate CoVe into your App with AIConfig💡

We’ve brought the CoVe technique to life using AIConfig, streamlining the process to help reduce hallucinations in your LLM applications.

Using AIConfig, we can separate the core application logic from the model components (prompts, model routing parameters, etc.). Here’s what the prompt template looks like:

1️⃣ GPT4 + Baseline Generation prompt: This sets the foundation by generating the initial response using GPT4.
2️⃣ GPT4 + Verification prompt: This prompt creates a series of verification questions based on the initial response.
3️⃣ GPT4 + Final Response Generation prompt: Leveraging the findings from the verification stage, this prompt generates a final, more reliable response.

🔗 AIConfig CoVE: https://github.com/lastmile-ai/aiconfig/tree/main/cookbooks/Chain-of-Verification

Want to see it action? 👀 Try out our demo in Streamlit!!

🎁 Streamlit App: https://chain-of-verification.streamlit.app/

Are you already using AIConfig or CoVe in your projects? Feel free to share your experiences in the comments below.

Liked the post?

Show your support by starring our project on GitHub! ⭐️ https://github.com/lastmile-ai/aiconfig

How to Use Idea Screening for New Feature and Product Development

December 15, 2023

Software

Milvus Adventures Dec 15, 2023

December 16, 2023

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Hand-Picked Top-Read Stories

Data Privacy and Vibe Coding

Lynxjs Extension Pack

AI/ML OPS-Learning Road Map

Trending Tags

Master LLM Hallucinations 💭

The Chain-of-Verification (CoVe) Process 🔗

Integrate CoVe into your App with AIConfig💡

Leave a Reply Cancel reply

Previous Post

How to Use Idea Screening for New Feature and Product Development

Next Post

Milvus Adventures Dec 15, 2023

Master LLM Hallucinations 💭

The Chain-of-Verification (CoVe) Process 🔗

Integrate CoVe into your App with AIConfig💡

Leave a Reply Cancel reply

Previous Post

Next Post

Related Posts