Software

2 minute read

Leveraging Gemini 1.5 Multimodal model(Generative AI) for Software development

April 11, 2024

leveraging-gemini-1.5-multimodal-model(generative-ai)-for-software-development

Image Source: https://dataedo.com/asset/img/banners/blog/cartoons.png

Google recently launched Gemini 1.5 Pro model, which is a mid-sized multimodal model optimised for scaling across wide range of tasks.
In this blog, we will learn how Gemini 1.5 Pro model can help us during software development.

This blog is an improved and recent version of my previous blog,

How Generative AI improves the productivity of Software developers

All examples in this blog use the freeform prompt in Google AI Studio and Gemini 1.5 pro model.

Below are some of the ways in which Gemini 1.5 Pro can help software developers,

1- Getting the answers by skimming through the content
Suppose you want to know answers to specific questions quickly rather than spending time in reading or viewing that text or video content. In that case, you can upload those files to Google AI Studio’s freeform prompt and ask your questions.
In case if you have multiple questions, you can use a chat prompt in Google AI studio.

Below example shows how the model reads the pdf file for me to get the answer to my query.

Below example shows how the model views the video file to get me the answer to my query.

2- Getting the Accessibility descriptions to use in the application

To enable our application to be used by diverse audience, as per W3C, its necessary that we provide a content description for all the visual content in our application.
We can get this by uploading the image file to the model to get the accessibility description for that content. This accessibility description can then be used in our applications.

3- Helps in understanding the code

I wanted to understand the code in this Github repository and how an offline-first app is built, so I downloaded it and choose “Folder upload” option in Google AI studio, post that I asked the model my queries and received below results.

4- Helps in understand UML Diagrams

If there are certain UML Diagrams that you’d want to get explanation for, to get a better understanding of the diagram, just upload the image and type an input prompt or start asking your queries about the UML Diagram and you’d receive the answers.

5- Helps in Code review

If you’d want to receive suggestions on a piece of code, just type it or take a screenshot of it and upload the image to Google AI studio and ask the model for a code review and you’d be surprised by the detailing in the review given as well as an improved version of the code snippet is provided.

Conclusion

The new Gemini 1.5 pro multimodal model can improve a software developer’s productivity by assisting them in various software development tasks and empowering them to get a better understanding of the code and software content.
They can create prompts for the above tasks and save and access them from the Google AI Studio from the ‘My library’ tab — https://aistudio.google.com/app/library

At the end of this blog post, I would encourage software developers to start leveraging Google’s Generative AI suite in various software development lifecycle tasks.

References

https://developers.googleblog.com/2024/02/gemini-15-available-for-private-preview-in-google-ai-studio.html

Leveraging Gemini 1.5 Multimodal model(Generative AI) for Software development was originally published in Google Developer Experts on Medium, where people are continuing the conversation by highlighting and responding to this story.

Fine Tuning Gemma-2b to Solve Math Problems

April 11, 2024

Marketing

How To Set Up a Content Publishing Process Worthy of Repetition

April 11, 2024

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Hand-Picked Top-Read Stories

System Design Basics – Caching

The Ultimate Guide to Pionia Generic Services.

Os primeiros 90 dias

Trending Tags

Leveraging Gemini 1.5 Multimodal model(Generative AI) for Software development

Leave a Reply Cancel reply

Previous Post

Fine Tuning Gemma-2b to Solve Math Problems

Next Post

How To Set Up a Content Publishing Process Worthy of Repetition

Leveraging Gemini 1.5 Multimodal model(Generative AI) for Software development

Leave a Reply Cancel reply

Previous Post

Next Post

Related Posts