Improving GPT's instruction following ability of non english prompts #440

Raghavan1988 · 2023-12-26T08:27:26Z

GPT 3.5's response quality in non english prompt is usually poorer compared to english atleast for south asian and middle eastern languages due to lesser representation in training data.

In this tutorial, i showed a simple translation trick improves the response quality anecdotally.

Showing GPT 3.5's gap in non english prompts and simple technique on how to improve response quality

Raghavan1988 · 2023-12-26T22:49:16Z

requesting review @ezzcodeezzlife @OlesiaZinchenko

DonGuillotine

Hello @Raghavan1988,

This is a really interesting tutorial, I'm sure people from non English backgrounds will benefit a lot from this.

I have a few suggestions to make your tutorial even more effective and compliant with the guidelines:

You have effectively used one # header and several ## H2 headings, Kindly add a specific H2 heading for "Topic Input" and another one to summarize all previously discussed points.
You've provided code snippets showing how to implement the translation layer. To make the tutorial more beginner-friendly, I suggest expanding the code implementation section. Specifically, it would be helpful to include a more complete example of how GPT-3.5 processes the translated prompt.
A possible enhancement could be a full implementation example that includes:
- A function for translating non-English prompts to English.
- Processing the translated prompt with GPT-3.5.
- Translating the response back to the original language.
- A simple example demonstrating the entire flow from input to output.

These suggestions will make your tutorial more accessible to beginners but also give them a practical tool to experiment with.

Thank you for your contribution to the community 🚀

Raghavan1988 · 2024-02-05T17:15:28Z

Thanks @DonGuillotine for the review. I made changes to create 3 functions

- Updated Header Structure - Corrected Markdown format for URLs with a custom text label

DonGuillotine · 2024-02-05T20:47:03Z

Thanks @DonGuillotine for the review. I made changes to create 3 functions

Hello @Raghavan1988,

It's great to see the updates you have made 🎉, I made some updates to the Header and Hyperlinks, kindly note it and apply to future tutorials.

Here are the final finishing touches to this great tutorial:

Remove the flask installation as it was not used
Use the latest version of langchain and openai. For example in the latest version of langchain from langchain.llms import OpenAI is no longer supported, rather use: from langchain_community.chat_models import ChatOpenAI
When I tested your code, especially the translate_prompt_from_language_x_to_english function I got errors suggesting the library fails to parse the response it receives from the Google Translate API. The documentation clearly states this:

Due to limitations of the web version of google translate, this API does not guarantee that the library would work properly at all times. (so please use this library if you don’t care about stability.)

Please consider using an alternative library that provides similar functionality such as translate or pydeepl

Looking forward to the updates, great work so far @Raghavan1988 🥇

Raghavan1988

Thanks @DonGuillotine for very thoughtful review.
I made the following changes.

I removed langchain and used only OpenAI API since latest langchain APIs are different.
I removed googletrans and used "translate" package as per your review.
Added the end to end flask code with pointers to working github repository that i tested

Apologies for the delay in addressing your review. Please let me know if it looks good for merge or any further changes needed.

Create improve_response_quality_in_non_english_languages.mdx

fd1a33f

Showing GPT 3.5's gap in non english prompts and simple technique on how to improve response quality

DonGuillotine self-assigned this Jan 28, 2024

DonGuillotine requested changes Jan 28, 2024

View reviewed changes

Update improve_response_quality_in_non_english_languages.mdx

51b5ace

Update improve_response_quality_in_non_english_languages.mdx

4e8f938

- Updated Header Structure - Corrected Markdown format for URLs with a custom text label

Raghavan1988 added 5 commits April 8, 2024 23:32

Update improve_response_quality_in_non_english_languages.mdx

649fd01

Update improve_response_quality_in_non_english_languages.mdx

8f0cdb1

Update improve_response_quality_in_non_english_languages.mdx

03d8bcc

Update improve_response_quality_in_non_english_languages.mdx

d6826bd

Update improve_response_quality_in_non_english_languages.mdx

5dbff9d

Raghavan1988 commented Apr 9, 2024

View reviewed changes

Raghavan1988 added 2 commits April 9, 2024 09:33

Update improve_response_quality_in_non_english_languages.mdx

e5a080d

Update improve_response_quality_in_non_english_languages.mdx

f49cdcc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving GPT's instruction following ability of non english prompts #440

Improving GPT's instruction following ability of non english prompts #440

Raghavan1988 commented Dec 26, 2023

Raghavan1988 commented Dec 26, 2023

DonGuillotine left a comment

Raghavan1988 commented Feb 5, 2024

DonGuillotine commented Feb 5, 2024 •

edited

Loading

Raghavan1988 left a comment •

edited

Loading

Improving GPT's instruction following ability of non english prompts #440

Are you sure you want to change the base?

Improving GPT's instruction following ability of non english prompts #440

Conversation

Raghavan1988 commented Dec 26, 2023

Raghavan1988 commented Dec 26, 2023

DonGuillotine left a comment

Choose a reason for hiding this comment

Raghavan1988 commented Feb 5, 2024

DonGuillotine commented Feb 5, 2024 • edited Loading

Raghavan1988 left a comment • edited Loading

Choose a reason for hiding this comment

DonGuillotine commented Feb 5, 2024 •

edited

Loading

Raghavan1988 left a comment •

edited

Loading