• Python (NumPy, pandas, scikit-learn, seaborn, Matplotlib)
• R (dplyr, ggplot2, DESeq2, tidyverse)
• Perl (BioPerl)
• Bash
• Snakemake
• SQL (MySQL)
• Omics analysis (genomics, transcriptomics, proteomics, metagenomics)
• Differential expression
• Mapping
• Sequence analysis
• Alignment
• Next-generation sequencing (NGS) analysis
• ncRNA analysis
• Annotation
• Genome assembly
• Software and method development
• Statistical tests: methods for group comparisons (t-test, Mann–Whitney U test, ANOVA, Kruskal–Wallis test)
• Regression: linear and logistic regression, LASSO, Random Forest, Support Vector Regression (SVR)
• Classification: Random Forest, Support Vector Machines (SVM), k-nearest neighbors
• Clustering: k-means
• Dimensionality reduction: PCA
• Cluster Platforms (user experience): Slurm, Sun Grid Engine (SGE)
• Version Control: Git, GitHub
• Linux, Jupyter Notebook, LaTex, Markdown, conda
• English: Near-native (C2, Cambridge CPE)
• Portuguese: Native (C2)
• German: Advanced (C1)
• Spanish: Advanced (B2/C1)
• Scientific writing (publications and technical reports)
• Presentation of results to interdisciplinary audiences
• Independent and proactive working style
• Collaboration in cross-functional teams (wet-lab scientists, biologists, computer scientists, statisticians, clinicians)
• Collaboration in multicultural environments (Germany, Brazil, Netherlands, Mexico, Canada, India)
• Project management and student mentoring
• Clear communication and structured problem-solving
• Strong motivation to learn new topics and technologies