March 2025 Meetup

The March BangPypers meetup, in collaboration with Google for Developers, provided valuable hands-on learning in the field of AI. Participants engaged in a workshop to develop a YouTube summarizer using Google Cloud and the google-genai library. The event also featured an insightful session on multi-modal models, offering practical knowledge and theoretical understanding.

Nikhil Rana expertly guided attendees through setting up their Google Cloud consoles and demonstrated the process of building GenAI-powered applications using Vertex AI. He then showcased a live YouTube summarizer application running on Google Cloud CodeRun.

Hand-on Workshop on Gemini

Following the workshop, Abhik Sarkar provided a comprehensive overview of multi-modal models, emphasizing their ability to integrate heterogeneous data sources, specifically text and images. He began by outlining the evolution from single-modality AI, highlighting deep learning’s advancements in individual text and image processing. He then detailed the core mechanisms: encoding modalities into a shared latent space using transformers, fusing these representations through cross-attention, and employing contrastive learning for optimized model training.

Understanding Multimodal Models: A Brief History and How They Work

The meetup concluded with a networking session and closing remarks thanking attendees from Abhilash and Prasad.

Abhilash Thanking Attendees

To stay updated with our future events and discussions: