feat: add client side retry to GeminiTextGenerator #1242

GarrettWu · 2024-12-26T21:41:41Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

b/385221339

jiaxunwu · 2024-12-27T21:05:51Z

bigframes/ml/llm.py

@@ -945,6 +946,7 @@ def predict(
        top_k: int = 40,
        top_p: float = 1.0,
        ground_with_google_search: bool = False,
+        retry: int = 0,


Can we rename it to max_retries?

jiaxunwu · 2024-12-27T21:08:44Z

bigframes/ml/llm.py


-        if (df[_ML_GENERATE_TEXT_STATUS] != "").any():
+            df_succ = df[df[_ML_GENERATE_TEXT_STATUS].str.len() == 0]
+            df_fail = df[df[_ML_GENERATE_TEXT_STATUS].str.len() > 0]


Can we just use df_succ to tell whether there is any failed row? Then it can reduce one filter op.

not really. We need df_fail for next retry, and append df_succ to df_result of current round. And I don't think DF subtract will be more effecient.

shobsi · 2024-12-27T22:44:47Z

bigframes/ml/llm.py

@@ -983,6 +985,10 @@ def predict(
                page for details: https://cloud.google.com/vertex-ai/generative-ai/pricing#google_models
                The default is `False`.

+            max_retries (int, default 0):
+                Max number of retry rounds if any rows failed in the prediction. Each round need to make progress (has succeeded rows) to continue the next retry round.


[nit] suggest rewording

Suggested change

Max number of retry rounds if any rows failed in the prediction. Each round need to make progress (has succeeded rows) to continue the next retry round.

Max number of retries if the prediction for any rows failed. Each try needs to make progress (i.e. has successfully predicted rows) to continue the retry.

shobsi · 2024-12-27T22:45:03Z

bigframes/ml/llm.py

@@ -983,6 +985,10 @@ def predict(
                page for details: https://cloud.google.com/vertex-ai/generative-ai/pricing#google_models
                The default is `False`.

+            max_retries (int, default 0):
+                Max number of retry rounds if any rows failed in the prediction. Each round need to make progress (has succeeded rows) to continue the next retry round.
+                Each round will append newly succeeded rows. When the max retry rounds is reached, the remaining failed rows will be appended to the end of the result.


[nit] suggest rewording

Suggested change

Each round will append newly succeeded rows. When the max retry rounds is reached, the remaining failed rows will be appended to the end of the result.

Each retry will append newly succeeded rows. When the max retries are reached, the remaining rows (the ones without successful predictions) will be appended to the end of the result.

shobsi · 2024-12-27T22:57:35Z

bigframes/ml/llm.py


-        if (df[_ML_GENERATE_TEXT_STATUS] != "").any():
+            df_succ = df[df[_ML_GENERATE_TEXT_STATUS].str.len() == 0]


for readability

success = df[_ML_GENERATE_TEXT_STATUS].str.len() == 0 df_succ = df[success] df_fail = df[~success]

shobsi · 2024-12-27T23:17:10Z

bigframes/ml/llm.py

+                break
+
+            df_result = (
+                bpd.concat([df_result, df_succ]) if not df_result.empty else df_succ


Isn't the if-else redundant here? bpd.concat() should handle if any of the input dfs is empty.

It gave me error in tests, sth related to multi-index

shobsi · 2024-12-27T23:17:41Z

bigframes/ml/llm.py

+            df_fail = df[df[_ML_GENERATE_TEXT_STATUS].str.len() > 0]
+
+            if df_succ.empty:
+                warnings.warn("Can't make any progress, stop retrying.", RuntimeWarning)


Looks like this warning will surface even if the user didn't wish for any retries (the default behavior), something which is not desirable

make sense.

GarrettWu added 4 commits December 26, 2024 21:18

feat: add client side retry to GeminiTextGenerator

2232d3a

test

5d60d06

test

36830b3

test

5af23e2

GarrettWu requested review from shobsi and jiaxunwu December 26, 2024 21:41

GarrettWu self-assigned this Dec 26, 2024

GarrettWu requested review from a team as code owners December 26, 2024 21:41

product-auto-label bot added size: m Pull request size is medium. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Dec 26, 2024

test

dad8eb8

product-auto-label bot added size: l Pull request size is large. and removed size: m Pull request size is medium. labels Dec 26, 2024

fix

6233332

jiaxunwu approved these changes Dec 27, 2024

View reviewed changes

GarrettWu added 3 commits December 27, 2024 21:40

max_retries

ed9918a

fix

b0fd896

fix

e928974

GarrettWu enabled auto-merge (squash) December 27, 2024 21:47

GarrettWu merged commit 8193abe into main Dec 27, 2024
22 checks passed

GarrettWu deleted the garrettwu-retry branch December 27, 2024 22:43

release-please bot mentioned this pull request Dec 27, 2024

chore(main): release 1.30.0 #1215

Merged

shobsi reviewed Dec 28, 2024

View reviewed changes

This was referenced Dec 31, 2024

chore: fix wordings of Gemini max_retries #1244

Merged

feat: add max_retries to TextEmbeddingGenerator and Claude3TextGenerator #1259

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add client side retry to GeminiTextGenerator #1242

feat: add client side retry to GeminiTextGenerator #1242

Uh oh!

GarrettWu commented Dec 26, 2024

Uh oh!

jiaxunwu Dec 27, 2024

Uh oh!

GarrettWu Dec 27, 2024

Uh oh!

jiaxunwu Dec 27, 2024

Uh oh!

GarrettWu Dec 27, 2024

Uh oh!

Uh oh!

shobsi Dec 27, 2024

Uh oh!

shobsi Dec 27, 2024

Uh oh!

shobsi Dec 27, 2024

Uh oh!

shobsi Dec 27, 2024

Uh oh!

GarrettWu Dec 30, 2024

Uh oh!

shobsi Dec 27, 2024

Uh oh!

GarrettWu Dec 30, 2024

Uh oh!

Uh oh!

	Max number of retry rounds if any rows failed in the prediction. Each round need to make progress (has succeeded rows) to continue the next retry round.
	Max number of retries if the prediction for any rows failed. Each try needs to make progress (i.e. has successfully predicted rows) to continue the retry.

	Each round will append newly succeeded rows. When the max retry rounds is reached, the remaining failed rows will be appended to the end of the result.
	Each retry will append newly succeeded rows. When the max retries are reached, the remaining rows (the ones without successful predictions) will be appended to the end of the result.


		if (df[_ML_GENERATE_TEXT_STATUS] != "").any():
		df_succ = df[df[_ML_GENERATE_TEXT_STATUS].str.len() == 0]

feat: add client side retry to GeminiTextGenerator #1242

feat: add client side retry to GeminiTextGenerator #1242

Uh oh!

Conversation

GarrettWu commented Dec 26, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!