Leveraging Gemini 1.5 Multimodal model(Generative AI) for Software development

leveraging-gemini-1.5-multimodal-model(generative-ai)-for-software-development
Image Source: https://dataedo.com/asset/img/banners/blog/cartoons.png

Google recently launched Gemini 1.5 Pro model, which is a mid-sized multimodal model optimised for scaling across wide range of tasks.
In this blog, we will learn how Gemini 1.5 Pro model can help us during software development.

This blog is an improved and recent version of my previousย blog,

How Generative AI improves the productivity of Software developers

All examples in this blog use the freeform prompt in Google AI Studio and Gemini 1.5 proย model.

Below are some of the ways in which Gemini 1.5 Pro can help software developers,

1- Getting the answers by skimming through the content
Suppose you want to know answers to specific questions quickly rather than spending time in reading or viewing that text or video content. In that case, you can upload those files to Google AI Studioโ€™s freeform prompt and ask your questions.
In case if you have multiple questions, you can use a chat prompt in Google AIย studio.

Below example shows how the model reads the pdf file for me to get the answer to myย query.

Below example shows how the model views the video file to get me the answer to myย query.

2- Getting the Accessibility descriptions to use in the application

To enable our application to be used by diverse audience, as per W3C, its necessary that we provide a content description for all the visual content in our application.
We can get this by uploading the image file to the model to get the accessibility description for that content. This accessibility description can then be used in our applications.

3- Helps in understanding theย code

I wanted to understand the code in this Github repository and how an offline-first app is built, so I downloaded it and choose โ€œFolder uploadโ€ option in Google AI studio, post that I asked the model my queries and received belowย results.

4- Helps in understand UMLย Diagrams

If there are certain UML Diagrams that youโ€™d want to get explanation for, to get a better understanding of the diagram, just upload the image and type an input prompt or start asking your queries about the UML Diagram and youโ€™d receive theย answers.

5- Helps in Codeย review

If youโ€™d want to receive suggestions on a piece of code, just type it or take a screenshot of it and upload the image to Google AI studio and ask the model for a code review and youโ€™d be surprised by the detailing in the review given as well as an improved version of the code snippet is provided.

Conclusion

The new Gemini 1.5 pro multimodal model can improve a software developerโ€™s productivity by assisting them in various software development tasks and empowering them to get a better understanding of the code and software content.
They can create prompts for the above tasks and save and access them from the Google AI Studio from the โ€˜My libraryโ€™ tabโ€Šโ€”โ€Šhttps://aistudio.google.com/app/library

At the end of this blog post, I would encourage software developers to start leveraging Googleโ€™s Generative AI suite in various software development lifecycle tasks.

References

https://developers.googleblog.com/2024/02/gemini-15-available-for-private-preview-in-google-ai-studio.html


Leveraging Gemini 1.5 Multimodal model(Generative AI) for Software development was originally published in Google Developer Experts on Medium, where people are continuing the conversation by highlighting and responding to this story.

Total
0
Shares
Leave a Reply

Your email address will not be published. Required fields are marked *

Previous Post
fine-tuning-gemma-2b-to-solve-math-problems

Fine Tuning Gemma-2b to Solve Math Problems

Next Post
how-to-set-up-a-content-publishing-process-worthy-of-repetition

How To Set Up a Content Publishing Process Worthy of Repetition

Related Posts