8
Certifications
5+
Projects
2
Live deployments
PT
Based in Portugal
01
Ingest
REST APIs
CSV
Batch I/O
02
Transform
PySpark
Pandas · NumPy
SQL
03
Orchestrate
Apache Airflow
GitHub Actions
Logging
04
ML-Ready
Feature Eng.
Model Eval
Prediction Svc
05
Deploy
Render
AWS Cognito
WAF
Customer Churn Pipeline
Video Demo
ML-integrated ETL for churn prediction. Deployed on Render with AWS Cognito auth and WAF security layer.
Python AWS ML Render
NLP Sentiment Pipeline
Live
Text processing pipeline over product reviews driving a Streamlit dashboard with real-time filtering by category and time.
NLP Streamlit Neural nets
Workforce SQL Analysis
Complete
Diagnostic analysis of employee data covering compensation equity, diversity metrics, and workforce stability insights.
SQL DataCamp Analytics
Customer Churn Pipeline
Video Demo
Designed and implemented an end-to-end machine learning pipeline for customer churn prediction, covering data ingestion, preprocessing, feature engineering, model training, and deployment. Built reusable ETL components for data cleaning, missing-value handling, feature binning, and statistical aggregation, then integrated model evaluation and profit-based threshold optimization to improve business decision-making. Deployed the system on Render with Amazon Web Services services including S3, Cognito, and WAF for storage, authentication, and security, creating a reliable and scalable production-ready prediction workflow.
Python ETL AWS ML Render
NLP Sentiment Analysis Pipeline
Live
Built a text processing pipeline to clean, transform, and analyze large-scale product review data. Developed a neural network for sentiment classification and integrated it into an interactive Streamlit dashboard with dynamic filtering by product, category, and time. The project enables real-time exploration of review trends and demonstrates a production-ready workflow that combines scalable text processing with machine learning–driven business insights.
NLP Streamlit Neural nets Python
Data Engineering & ETL
ETL Pipeline Design Data Transformation Data Cleaning Batch Processing Schema Design Data Modeling Hive-style Partitioning
Programming & Processing
Python SQL PySpark Pandas NumPy pytest
Orchestration & Monitoring
Apache Airflow DAG Design Structured Logging Pipeline Debugging GitHub Actions (CI/CD)
Cloud & Deployment
AWS S3 AWS Cognito AWS WAF Render Git
ML Integration & Analysis
Supervised Learning Feature Engineering Model Evaluation ML-Ready Pipelines EDA Matplotlib Seaborn
  • Critical Thinking & Problem Solving
  • Analytical Mindset & Data-Driven Thinking
  • Attention to Detail
  • Curiosity & Learning Agility
  • Ability to Present Results & Insights
  • Persistence & Self-Discipline
Data Engineer Professional
DataCamp
View PDF
AI Engineer for Data Scientists Associate
DataCamp
View PDF
Data Scientist Professional
DataCamp
View PDF
Data Analyst
DataCamp
View PDF
Python Data Associate
DataCamp
View PDF
SQL Associate
DataCamp
View PDF
AI Fundamentals
DataCamp
View PDF
Data Literacy
DataCamp
View PDF

Education

Bachelor's in Informatics Engineering
ESTG-IPVC — Instituto Politécnico de Viana do Castelo

Location

Based in Portugal. Open to remote positions and hybrid in the north of portugal opportunities.

Looking for

Junior Data Engineer role to contribute to data infrastructure and grow in distributed systems and orchestration.

I am a Data Engineer with a background in Informatics Engineering, focused on building scalable data pipelines and production-ready data systems.

I have hands-on experience designing ETL workflows, transforming large datasets, and preparing data for machine learning applications. I have built end-to-end pipelines covering data ingestion, transformation, modeling, and deployment.

Recently, I completed the Data Engineer Professional Certification, working with tools such as Airflow and logging systems, strengthening my understanding of workflow orchestration and pipeline monitoring.

I am particularly interested in building reliable, scalable, and ML-ready data systems, bridging Data Engineering and Machine Learning.

Currently seeking a junior Data Engineer role to contribute to data infrastructure and grow in distributed systems and orchestration.