Skip to content

Conversation

vishal-dharm
Copy link
Collaborator

Description of the change

This PR fixes the "argument list too long" error in the text_generation.sh file when using large base64 encoded images by using temporary files to store the encoded data and the JSON payload. This resolves issues encountered when running the text_gen_multimodal_one_image_prompt and text_gen_multimodal_one_image_prompt_streaming examples.

This PR also makes the following additions and updates:

  • Adds two new multimodal vision examples to text_generation.sh:

    • text_gen_multimodal_two_image_prompt: Demonstrates using two images in a single prompt.
    • text_gen_multimodal_one_image_bounding_box_prompt: Shows how to generate bounding boxes for objects.
  • Updates the text_gen_multimodal_video_prompt example: The prompt is now more comprehensive and does a better job demonstrating Gemini 1.5's multimodal capabilities.

Motivation

These examples are being added so they can be included in the revamped vision documentation.

Type of change

Feature request

Checklist

  • I have performed a self-review of my code.
  • I have added detailed comments to my code where applicable.
  • I have verified that my change does not break existing code.
  • My PR is based on the latest changes of the main branch (if unsure, please run git pull --rebase upstream main).
  • I am familiar with the Google Style Guide for the language I have coded in.
  • I have read through the Contributing Guide and signed the Contributor License Agreement.

@vishal-dharm vishal-dharm added the status:awaiting review PR awaiting review from a maintainer label Nov 16, 2024
@github-actions github-actions bot added the component:python sdk Issue/PR related to Python SDK label Nov 16, 2024
@MarkDaoust
Copy link
Collaborator

Thanks!

@MarkDaoust MarkDaoust merged commit a04fcd1 into main Nov 16, 2024
12 checks passed
@github-actions github-actions bot removed the status:awaiting review PR awaiting review from a maintainer label Nov 16, 2024
@vishal-dharm vishal-dharm deleted the rest-vision-examples branch November 16, 2024 18:52
Annhiluc pushed a commit to Annhiluc/generative-ai-python that referenced this pull request Nov 22, 2024
MarkDaoust pushed a commit that referenced this pull request Nov 26, 2024
* fix: Pass along model_version in GenerateContentResponse.

* Revert autogenerated doc files from 94eb16e.

* Fix 'argument list too long' error and add couple vision examples (#634)

* Update google-ai-generativelanguage version in requirements.

* Format updated generation_types and test using black.

---------

Co-authored-by: Vishal Dharmadhikari <61256217+vishal-dharm@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component:python sdk Issue/PR related to Python SDK
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants