Skills

Programming

• Python (NumPy, pandas, scikit-learn, seaborn, Matplotlib)
• R (dplyr, ggplot2, DESeq2, tidyverse)
• Perl (BioPerl)
• Bash

Workflow & Automation

• Snakemake

Databases

• SQL (MySQL)

Bioinformatics

• Omics analysis (genomics, transcriptomics, proteomics, metagenomics)
• Differential expression
• Mapping
• Sequence analysis
• Alignment
• Next-generation sequencing (NGS) analysis
• ncRNA analysis
• Annotation
• Genome assembly
• Software and method development

Machine Learning

• Statistical tests: methods for group comparisons (t-test, Mann–Whitney U test, ANOVA, Kruskal–Wallis test)
• Regression: linear and logistic regression, LASSO, Random Forest, Support Vector Regression (SVR)
• Classification: Random Forest, Support Vector Machines (SVM), k-nearest neighbors
• Clustering: k-means
• Dimensionality reduction: PCA

Infrastructure

• Cluster Platforms (user experience): Slurm, Sun Grid Engine (SGE)
• Version Control: Git, GitHub

Other Tools

• Linux, Jupyter Notebook, LaTex, Markdown, conda

Languages

• English: Near-native (C2, Cambridge CPE)
• Portuguese: Native (C2)
• German: Advanced (C1)
• Spanish: Advanced (B2/C1)

Communication & collaboration

• Scientific writing (publications and technical reports)
• Presentation of results to interdisciplinary audiences
• Independent and proactive working style
• Collaboration in cross-functional teams (wet-lab scientists, biologists, computer scientists, statisticians, clinicians)
• Collaboration in multicultural environments (Germany, Brazil, Netherlands, Mexico, Canada, India)
• Project management and student mentoring
• Clear communication and structured problem-solving
• Strong motivation to learn new topics and technologies