Gustavo Ribeiro dos Santos

Senior Data & Cloud Engineer | AI/MLOps Specialist
New York, US.

About

Senior Data and Cloud Engineer with 10 years of experience, specializing in multi-cloud (AWS, GCP, Azure) data engineering, MLOps, and AI/LLM solutions. Proven leader in architecting scalable data lakes, developing advanced machine learning models, and driving efficiency, delivering high-value results across banking, healthcare, and technology sectors.

Work

Keeggo
|

FinOps Engineer

São Paulo, São Paulo, Brazil

Summary

Drove strategic value through cost optimization and compliance, leveraging expertise in data analysis, AI, and automation within multi-cloud environments.

Highlights

Identified and implemented innovative solutions that minimized cloud costs and ensured regulatory adherence across Azure, GCP, and AWS environments, significantly improving financial efficiency.

Managed sensitive data for a crucial Federal Government project, utilizing data analysis and AI to extract information, identify patterns, and optimize acquisition processes, enhancing transparency.

Combined strategic vision with technical execution to achieve sustainable financial and operational objectives for organizations, reducing expenses and improving resource utilization.

CI&T
|

Senior Cloud Engineer, Google Cloud Specialist

New York, New York, US

Summary

As a Cloud and AI specialist, ensured the robustness and innovation of the organization's Google Cloud ecosystem by designing and building scalable, highly available, and AI-first cloud solutions.

Highlights

Developed complex machine learning models for banking, healthcare, and technology sectors, leveraging Retrieval-Augmented Generation (RAG) and Reinforcement Learning to deliver accurate, high-value results.

Engineered and implemented cutting-edge AI applications, including agents and multi-agent systems, utilizing Google tools (AgentSpace, Vertex AI, Gemini, Dialog Flow) and open-source frameworks (Crew AI, LangChain).

Transformed raw data into intelligent, automated actions by integrating advanced AI functionalities into cloud solutions, enhancing operational efficiency and decision-making.

Cognizant
|

Data Engineer

São Paulo, São Paulo, Brazil

Summary

Led large-scale data processing on Google Cloud and Microsoft Azure, building ETL/ELT pipelines, designing Data Lake architectures, and promoting a data-driven culture.

Highlights

Developed and optimized ETL/ELT pipelines for batch and streaming data, processing large-scale data on Google Cloud and Microsoft Azure, ensuring efficient data extraction, treatment, and cleaning.

Designed and developed scalable Data Lake architectures and orchestrated data workflows with Apache Airflow, enhancing data availability and reliability.

Optimized complex SQL query performance in BigQuery and automated data processes with Alteryx, improving data retrieval efficiency by 15%.

Pioneered a Data-Driven culture within the LATAM data team, leading training programs for interns and new employees in data analysis, science, engineering, and Google Cloud computing.

Defined and executed the initial phase of an automated testing system using Generative AI (Vertex AI and Gemini), including monitoring and managing resources on Google Cloud.

Magna Sistemas
|

Senior Data Engineer

São Paulo, São Paulo, Brazil

Summary

Developed robust web applications and integrated complex APIs, ensuring high-quality delivery and database management for diverse projects.

Highlights

Developed web applications using JavaScript, Angular, and Liferay DXP, enhancing user experience and system functionality.

Integrated and managed APIs with InterSystems (IRIS, Caché, Ensemble, COS, Zen, CSP), REST APIs with Python, Java 8+, and Spring Framework, improving system interoperability.

Managed and optimized databases including PostgreSQL, SQL Server, and Oracle, ensuring data integrity and high performance.

Implemented unit and integration tests and CI/CD pipelines, significantly improving software quality and delivery efficiency.

OSM
|

Software Engineer

Brasília, Brasília, Brazil

Summary

Supported and developed applications for legal and healthcare systems, enhancing data processing and integration for large public agencies.

Highlights

Provided support, conducted code reviews, and developed applications for critical legal and healthcare systems using SQL, PL/SQL, Python, JavaScript, and InterSystems.

Developed ETL pipelines and performed data analysis and integration with IRIS Data Platform and MDX multidimensional language, improving data flow and accessibility.

Maintained DBMS systems including PostgreSQL, SQL Server, Redis, Oracle, and MongoDB, ensuring database stability and performance.

Banco do Brasil
|

Software Engineering Intern

Brasília, Brasília, Brazil

Summary

Contributed to database development, data analysis, and automation, enhancing internal applications and reporting capabilities.

Highlights

Developed and managed databases using MySQL and IBM DB2, supporting critical banking operations.

Performed data analysis and statistics using SAS (Enterprise Guide and Data Explorer Management), providing insights for decision-making.

Created dashboards with PowerBI and Spotfire, visualizing key performance indicators and improving data accessibility for management.

Implemented Python and Go scripts for task execution and automation, significantly streamlining operational processes and reducing manual effort.

Assisted in developing applications for the legal intranet using Java and Spring Framework, and enhanced web applications with IONIC, React, HTML, CSS, JavaScript, and jQuery.

Education

Thomas Edison State University
United States of America

Associate's degree

Architecting Cloud Solutions

Universidade Católica de Brasília
Brasília, Brasília, Brazil

Bachelor's degree

Software Engineering

Awards

Airflow Champion

Awarded By

Astronomer

Awarded for significant contributions to the development and evolution of Apache Airflow technologies.

GDG Organizer

Awarded By

Google Developer Groups

Recognized for leadership in organizing Google Developer Groups, representing Google Cloud in Brazil.

Google Cloud AI Trusted Tester

Awarded By

Google

Granted access to pre-release Google Cloud AI technologies to provide feedback and corrections.

Microsoft Student Ambassador

Awarded By

Microsoft

Recognized for contributions and leadership within Microsoft student programs.

Microsoft MVP

Awarded By

Microsoft

Awarded for exceptional technical expertise and community contributions.

GitHub Program Member

Awarded By

GitHub

Recognized for active participation and contributions to the GitHub community and programs.

Languages

English
French
Spanish

Certificates

DP-900: Azure Data Fundamentals

Issued By

Microsoft

AZ-900: Azure Fundamentals

Issued By

Microsoft

CLF-C02: AWS Certified Cloud Practitioner

Issued By

AWS

Astronomer Certification for Apache Airflow Fundamentals

Issued By

Astronomer

Astronomer Certification DAG Authoring for Apache Airflow

Issued By

Astronomer

GitHub Foundations

Issued By

GitHub

IBM Cloud Essentials

Issued By

IBM

Google Cloud Generative AI

Issued By

Google Cloud

Google Cloud Responsible AI

Issued By

Google Cloud

Skills

Languages & Fundamentals

Python (data manipulation, code optimization), SQL (advanced), ETL/ELT, REST APIs.

Cloud & MLOps

Google Cloud Platform (GCP), VertexAI, AWS, Microsoft Azure, Multi-cloud, DevOps, CI/CD, MLOps.

Data & Workflow

Data Engineering, Data Science, AI, Machine Learning, Gemini (Generative AI), Apache Airflow (Orchestration), Databricks, Alteryx, SAS.

Infrastructure & Automation

Automation, Optimization, FinOps, Cybersecurity.

Databases

SQL Server, MongoDB, IBM DB2, Cloud SQL, DynamoDB, Google BigQuery, Azure Data Lake Gen2, InterSystems IRIS/Cache, PostgreSQL, Oracle, MySQL, Redis.

Soft Skills

Leadership, Mentoring, Communication, Strategic Vision.

Projects

Google Developer Groups (GDG) Organizer

Summary

Led Google Developer Groups, a globally recognized Google initiative, representing Google Cloud in Brazil and engaging with the developer community.

International Speaker - The Developer's Conference, Campus Party, Build with AI, Google DevFest, Google IO Extended

Summary

Actively participated as an international speaker at numerous high-profile technology conferences and events across Brazil and Peru.

TDC (The Developer's Conference) Cloud & Generative AI Track Coordinator

Summary

Coordinated the Cloud track and Generative Artificial Intelligence track for The Developer's Conference in São Paulo and Brasília.

Ford Motor Company's First Data Lake (Latin America)

Summary

Actively participated in the development of Ford Motor Company's inaugural Data Lake across Latin America, establishing foundational data infrastructure.

CBIE 2019 - Brazilian Congress on Informatics in Education Exhibitor

Summary

Participated as an exhibitor at the Brazilian Congress on Informatics in Education (CBIE 2019).

IBM Cloud Discovery & Bluehack Hackathon Participant

Summary

Participated in the IBM Cloud Discovery and IBM's Bluehack Hackathon in São Paulo.