Mauricio Arancibia
Mauricio Arancibia

Mauricio Arancibia

Data Scientist | AI Engineer | ML Specialist

๐Ÿ“ง necrus.aikon@gmail.com | ๐Ÿ“ž (+591) 75761340 | ๐Ÿ“ Sucre, Bolivia ๐ŸŒ www.neuraldojo.org | LinkedIn | GitHub | Kaggle | Medium

Professional Summary

Innovative System Engineer with over a decade of expertise in Data Science, Machine Learning, and Artificial Intelligence. Specialized in leveraging cutting-edge technologies to drive business impact through data-driven insights and advanced AI solutions.
  • ๐Ÿง  Expert in Data Science, Machine Learning, Deep Learning, and Generative AI
  • ๐Ÿ’ก Proven track record in Natural Language Processing and innovative AI projects
  • ๐Ÿค Skilled in collaborative environments, combining analytical prowess with attention to detail
  • ๐Ÿš€ Passionate about pushing the boundaries of AI technology to solve complex business challenges
notion image
ย 

Professional Experience

NTTData | Generative AI

September 2024 - Present| USA (Remote)
Leading innovative AI initiatives and driving strategic, data-driven decision-making:
โ€ข ๐Ÿ† Generative AI Tech Lead: Led the design and development of cutting-edge Proof of Concepts (PoCs), Minimum Viable Products (MVPs), and strategic proposals leveraging Generative AI to drive business transformation.
โ€ข ๐Ÿ“Š Data Science Chapter Leader: Directed cross-functional teams in delivering advanced analytics, machine learning solutions, and AI-driven insights, fostering a data-centric culture within the organization.
โ€ข ๐Ÿค– Generative AI Innovation: Spearheaded initiatives using Generative AI for real-world applications, such as automated content generation, conversational agents, and personalized recommendations, revolutionizing business processes and improving user experiences.
Key Technologies: Python, PySpark, AWS (SageMaker, Bedrock), Azure (Data Lake, Synapse, OpenAI, AI Services), LangChain, LlamaIndex, Vector Databases, Retrieval-Augmented Generation (RAG), AI Agents, Large Language Models (LLMs)

NTTData | Data Scientist/Data Analyst

July 2022 - August 2024| USA (Remote)
Advanced Data Analytics and Data Governance:
โ€ข ๐Ÿ” ETL & EDA Analytics: Executed efficient ETL processes and exploratory data analysis to extract actionable insights
โ€ข ๐Ÿ“Š Data Governance & Data Quality: Ensured data integrity and consistency across various business operations
โ€ข ๐Ÿง  NLP Projects: Implemented Natural Language Processing solutions to enhance business processes and automation
โ€ข ๐Ÿš€ Generative AI PoCs: Led Proof of Concepts for Retrieval-Augmented Generation (RAG) systems
Key Technologies: PowerBI, Python (Anaconda), PySpark, Azure Synapse, , Jira, Miro, Plotly, Matplotlib, Seaborn, Generative AI, Collibra, Datalake, Azure ML, AutoML, NLP, Keras, Pytorch, Tensorflow

UNICEF Bolivia | Data Analyst Consultant

November 2022 - February 2023 | Bolivia (Remote)
Developed data visualization solutions for UNICEF Bolivia:
โ€ข ๐Ÿ“Š PowerBI Dashboard Development: Created comprehensive dashboards to visualize and analyze key data for UNICEF Bolivia's programs and initiatives
โ€ข ๐Ÿ” Data Analysis: Conducted in-depth analysis of program data to provide actionable insights for decision-making
โ€ข ๐Ÿ“ˆ Performance Metrics: Designed and implemented KPI tracking systems to monitor program effectiveness
โ€ข ๐Ÿค Stakeholder Collaboration: Worked closely with UNICEF Bolivia team to ensure dashboards met their specific needs and requirements
Key Technologies: PowerBI, Excel, SQL, Data Modeling, DAX

FANCESA | Senior Data Manager

Present | Sucre, Bolivia
Revolutionizing data management and business intelligence in the cement industry:
  • ๐Ÿ“ˆ Business Intelligence and PowerBI Implementation: Led company-wide adoption of PowerBI, developing critical dashboards for production, quality control, and planning
  • ๐Ÿ’ฐ Cost Production Analytics: Created comprehensive dashboards for monitoring cement production costs across the entire supply chain
  • ๐Ÿ—๏ธ Green Field Cement Plant Project: Managed data integration and developed ETL processes for a new plant construction project
Key Technologies: PowerBI, Qlik, PostgreSQL, Python, Sharepoint

NTTData for UNICEF's Projects | Data Analyst

December 2021 - June 2022 | New York, USA
Leveraging data analytics to support global social initiatives:
  • ๐Ÿ“‘ Semantic Similarity Analysis: Implemented NLP techniques to analyze UNICEF project documents, enhancing project alignment and efficiency
  • โœˆ๏ธ Travel Analytics Dashboard: Developed PowerBI dashboard for optimizing staff travel, identifying key insights for cost and time management
  • ๐Ÿ—ฃ๏ธ Sentiment Analysis: Enhanced progress report analysis using advanced NLP techniques
Key Technologies: PowerBI, Python, NLP (NLTK, SpaCy, Gensim, Transformers), Keras+Tensorflow

Education

  • ๐ŸŽ“ Master in Advanced & Applied Artificial Intelligence - Universitat de Valencia (2021-2022)
  • ๐Ÿ“Š Data Scientist Nanodegree - Udacity (2021-2022)
  • ๐Ÿ’ผ Business Intelligence Specialization - Universidad Tecnolรณgica de Buenos Aires (2014)
  • ๐Ÿ–ฅ๏ธ Free Software University Master - Universitat Oberta de Catalunya (2009-2010)
  • ๐Ÿ”ง System Engineering - Universidad de San Francisco Xavier de Chuquisaca (2001-2005)

Certifications

  • โ˜๏ธย AWS Certified AI Practitioner - AWS (September 2024)
    • Earners of this badge understand AI, ML, and generative AI concepts, methods, and strategies in general and on AWS. They can determine the correct types of AI/ML technologies to apply to specific use cases and know how to use AI, ML, and generative AI technologies responsibly. They are familiar with the AWS Global Infrastructure, core AWS services and use cases, AWS service pricing models, and the AWS shared responsibility model for security and compliance in the AWS Cloud.
  • โ˜๏ธ Microsoft Certified: Azure AI Fundamentals - Microsoft (June 2024)
    • Credential ID: E2334D8F28BFD0E2
    • Demonstrates proficiency in Azure AI services, including machine learning, computer vision, natural language processing, and conversational AI
  • ๐Ÿค– Machine Learning - Coursera, Stanford Online (July 2021)
    • Comprehensive course covering supervised and unsupervised learning, best practices in machine learning and AI innovation
  • ๐Ÿ“Š Applied Data Science with Python Specialization - Coursera, University of Michigan (April 2021)
    • Series of courses covering data manipulation, visualization, machine learning, text mining, and social network analysis in Python
  • ๐Ÿ“ˆ Reproducible Research - Coursera, Johns Hopkins University (August 2016)
    • Focused on modern reproducible research data concepts and tools
  • ๐Ÿ“‰ Exploratory Data Analysis - Coursera, Johns Hopkins University (April 2016)
    • Techniques for summarizing data and creating analytic graphics in R
  • ๐Ÿ” Getting and Cleaning Data - Coursera, Johns Hopkins University (March 2015)
    • Methods for obtaining, cleaning, and managing data from various sources
  • ๐Ÿ”ฎ Pattern Discovery in Data Mining - Coursera, University of Illinois (March 2015)
    • Concepts and methodologies in data mining, focusing on pattern discovery and analysis
  • ๐Ÿ“Š R Programming - Coursera, Johns Hopkins University (March 2015)
    • Comprehensive course on R programming for data analysis
  • โš™๏ธ Process Mining: Data Science in Action - Coursera (January 2015)
    • Techniques for process discovery, conformance checking, and process analysis

Skills

ย 
Category
Skills
๐Ÿ“Š Data Visualization
PowerBI, Qlik, Tableau, Matplotlib, Plotly, Seaborn
๐Ÿ’ป Programming
Python, R, SQL, Java, REST API
๐Ÿค– Machine Learning
TensorFlow, Keras, PyTorch, Scikit-Learn, Azure ML, Azure Sagemaker
โ˜๏ธ Cloud & Big Data
AWS Bedrock, Azure AI Services, Azure Open AI, AWS AI Services
๐Ÿ—ฃ๏ธ NLP
NLTK, spaCy, Gensim, Transformers
๐Ÿง  Generative AI
LangChain, LlamaIndex, RAG, Fine-Tuning, Agents, CrewAI, LangGraph, Closed & Open Sources LLMs
๐Ÿ› ๏ธ MLOps/DevOps
Git, Docker, CI/CD pipelines
๐ŸŽจ UI/UX
Streamlit, Gradio
Databases
Azure SQL, PostgreSQL, Vector Databases

Notable Projects & Achievements

  • ๐Ÿง  Neural Dojo Founder: Created a community blog focused on Data Science, ML, and AI, fostering knowledge sharing and ethical AI applications
  • ๐Ÿ† Coursera Specializations: Completed multiple specializations including Machine Learning (Stanford) and Applied Data Science with Python (University of Michigan)
  • ๐Ÿš€ Open Source Contributions: Developed and shared multiple ML and AI projects on GitHub, including sentiment analysis, facial expression recognition, and weather classification models

Languages

  • ๐Ÿ‡ช๐Ÿ‡ธ Spanish (Native)
  • ๐Ÿ‡ฌ๐Ÿ‡ง English (Professional Working Proficiency)

ย 
ย 
ย 
ย 
Built with Potion.so