Site style
Variant 03 / 03 · Hybrid profile
ADRIEN OLIVA

Bioinformatics, data science & agentic AI for high-stakes, high-dimensional decisions.

Senior Bioinformatician / Data Scientist with a PhD in Computational Biology (2022) and 8+ years building production ML pipelines, statistical models, and cloud workflows from raw sequencing reads to decision-ready reports. Currently Lead Bioinformatician at Genics. Lead data scientist on $3.2M+ NIH / UNDP / DFAT-funded research at CSIRO + Mayo Clinic. Quantitative analyst at AE Capital Hedge Fund. Builds agentic AI systems on the side.

BasedBrisbane, Australia
Mentored15+ team members
Papers6 first-author Q1
Quant experienceAE Capital Hedge Fund
LAST UPDATED MAY 2026
PHD·DS·AI
Research-grade methods, production-grade infrastructure, and an AI-first builder's mindset, in one stack.
Bioinformatics
NGS · Pangenome · Microbiome · Personalised medicine
Data Science
ML · Bayesian stats · Prediction · Time-series · ETL
Agentic AI
Multi-agent · Token optimization · Local LLMs · Automation
Quant edge
Time-series · Signal validation · Anomaly detection
2018 · PhD start2026 · today
Worked with research institutes, biotech, hedge funds, and clinical partners
CSIRO
Mayo Clinic
Genics
AE Capital Hedge Fund
Univ. Paris-Saclay
CNRS
About

A bioinformatician who ships data products, finds signal in noise, and deploys what works.

I work in the messy middle between research-grade methods and production infrastructure: the place where raw sequencing data, statistical assumptions, cloud systems, stakeholder needs, and deadlines all collide.

At Genics I lead bioinformatics strategy across commercial genomics: end-to-end Nanopore 16S pipelines for microbiome and pathogen detection, anomaly detection on noisy signals, automated client reporting, genomic selection databases, and the SOPs that keep all of it reproducible.

Before that I spent three years at CSIRO co-leading cloud genomics and personalised medicine programs with Mayo Clinic, Université Paris-Saclay, and partners in Indonesia and Korea. In charge of the FHIR + blockchain consent infrastructure (GeneGuardian), and built pharmacogenomics and variant detection workflows (PharmCAT, sBeacon, VariantSpark, DeepVariant) alongside pangenome graph methods. Lead data scientist on $3.2M+ NIH, UNDP and DFAT-funded research.

Quantitative finance is something I genuinely love. I had a first taste at AE Capital Hedge Fund validating ML signals on multi-asset time-series, and I would love another shot at it. In the meantime I keep that muscle alive through my agentic AI work, where one of the live use cases is algorithmic trading research. The pattern across genomics, pathogens, and markets is the same: high-dimensional noisy data, statistical rigour, and decisions that have to hold up under scrutiny.

On the side I build agentic AI infrastructure (OpenClaw + Hermes), orchestrating Claude, GPT, and local Ollama models to automate research, document workflows, and business operations.

Quick facts

Based inBrisbane, Australia
Experience8+ years
EducationPhD completed 2022
CitizenshipFrench + Australian
LanguagesEN · FR · ES (beg.)

Open to

PrimarySenior Bioinformatician / Computational Biologist / Data Scientist
Very interested inQuant analyst / quant research
Role typeSenior hands-on or team lead
LocationsAustralia · Europe · Remote
IndustriesHealth · Pharma · Personalised medicine · AgTech · Hedge funds
Selected Work

Production pipelines, applied ML, and quantitative signal work.

Genics · 2026Production

Nanopore 16S microbiome & pathogen pipeline

Architected, set up and deployed the company's full Oxford Nanopore analytical stack: live DNA sample to pathogen detection report. Covered basecalling, mapping, variant calling, pathway analysis, statistical testing, SQL integration, and automated client reporting. Validated and reliability-tested end to end.

Live DNA → pathogen reportarchitected · built
deployed · validated
Nanopore16S microbiomePathogen detectionPythonSQLPipeline architectureQA & validationClient reporting
CSIRO · 2024Deployed to customers

GeneGuardian: AI-assisted data governance

Co-led an AI-assisted dynamic consent platform for genomic data: FHIR integration, blockchain, decentralized storage (IPFS / FileCoin / Storj), AWS + Terraform deployment. HIPAA / GDPR / GA4GH compliant. Now live and serving customers. Supervised 10+ students across IT, medical informatics, and bioinformatics.

geneguardian.onlinelive platform
1st-author paper
FHIRBlockchainAWSTerraformIPFSGA4GHHIPAA · GDPRTeam supervision (10+)Scientific writing
CSIRO · 2022–2025Deployed

Cloud genomics for international partners

Deployed and trained partners on end-to-end pipelines hosted in their own AWS accounts across Indonesia, Korea, Mayo Clinic, and Australian clinical sites, supporting pharmacogenomics, infectious disease surveillance (COVID-19, TB), and data sovereignty. Stack covered PharmCAT (pharmacogenomics), sVEP (variant effect prediction), VariantSpark (random-forest GWAS), and sBeacon (federated genomic data exchange).

4 countries · 4 stacksdeployed · trained
supported in production
AWSTerraformPharmacogenomicsVariant callingGWASFederated dataCross-border deploymentPartner trainingStakeholder engagement
AE Capital · 2022Quant

ML signal validation on multi-asset time-series

Validated machine-learning signals and built statistical models on high-dimensional financial time-series across equity prices, bond valuations, and macro indicators. Feature engineering, anomaly detection, EDA, and risk management in a fast-paced hedge-fund environment.

Multi-assettime-series · signals
risk management
PythonMATLABTime-seriesML signal validationRisk managementAnomaly detectionFeature engineeringEDAStatistical modelling
Genics · 2026Live

Commercial SNP array pipeline (new species)

Sole developer for a commercial-grade SNP array pipeline on a new tree species, leading the work from scoping and resource/price estimation through validation, delivery, and client-owned IP handover. Also built the relational SQL database (DbSchema) for downstream genomic selection.

New SNP array · new IPscoped · built
validated · shipped
SNP arrayRSQL · DbSchemaGenomic selectionProject scopingResource estimationQA & validationClient delivery
Personal · 2026Building

OpenClaw + Hermes agentic AI framework

Open-source multi-agent orchestration framework routing tasks across Claude API (reasoning + code), OpenAI GPT (general-purpose), and local Ollama Qwen models (lightweight heartbeat agents). Live use cases: algorithmic trading research, personal AI assistant, automated webpage generation, business-development automation, plus research and bibliography workflows driven by genuine curiosity.

Multi-LLMClaude · GPT · Ollama
token optimization
Agentic AIMulti-LLM orchestrationClaude SkillsLocal LLMsToken optimizationAutomationResearch & bibliographyOpen source
CSIRO + Mayo Clinic · 2025Published

Pangenome for personalised medicine in underrepresented populations

Led a multi-institution collaboration with Mayo Clinic and Université Paris-Saclay applying pangenome graph methods to improve precision medicine for Somali and other underrepresented Horn of Africa / Middle Eastern populations. Directed 10+ contributors across IT, bioinformatics, and biostatistics. Quantified reference bias effects on PCA, D, and F-statistics; ran the full biostatistics analysis on HPC.

10+ directedmulti-institution
1st-author paper
PangenomePersonalised medicinePopulation geneticsBiostatisticsHPCPCA · D · F-statsTeam leadership (10+)Multi-institution
PhD · 2021–2025Q1 papers

Pangenome methods for reference bias

Benchmarked ancient-DNA alignment methods, quantified reference bias effects on PCA, D, and F-statistics, and built pangenome graph workflows that materially improved analyses for underrepresented populations. Trained Google DeepVariant (deep-learning variant caller) on aDNA-specific damage patterns.

3 × Q1Briefings in Bioinformatics
Ecology & Evolution · BioMolecules
PangenomeDeepVariantBWAHPCPopulation geneticsMethod benchmarkingScientific writing
Experience

Bioinformatics foundation, data science delivery.

Jan 2026 – Present current

Lead Bioinformatician

Genics Pty Ltd · Brisbane, Australia

Leading bioinformatics strategy and end-to-end pipeline development for a commercial genomics company serving agricultural and industrial clients.

  • Architected the company's full Oxford Nanopore 16S microbiome stack, from live DNA sample to pathogen detection report.
  • Sole developer of a commercial SNP-array pipeline for a new tree species: scope, resource estimation, validation, delivery, IP handover.
  • Designed the genomic selection SQL database (DbSchema) and authored SOPs to centralise reproducibility, QA, and onboarding.
  • Translating complex genomic data into reports and content for non-technical stakeholders (farmers, producers, C-suite).
  • Authoring peer-reviewed publications, industry magazine articles, and blog posts to grow scientific engagement.
Pipeline architectureNanoporeMicrobiomePathogen detectionSNP arraySQL · DbSchemaSOPs · QAStakeholder commsScientific writingTeam leadership
Oct 2022 – Oct 2025

Cloud Genomic Data Scientist

CSIRO · Remote, Australia

Led and delivered 4+ concurrent national and international programs spanning cloud genomics, AI-assisted bioinformatics, and federated learning.

  • Co-led GeneGuardian, a FHIR + blockchain + decentralized storage consent platform on AWS, now deployed to customers at geneguardian.online. 1st author in GigaScience (Q1) · DOI 10.1093/gigascience/giae021.
  • Totally led the multi-institution pangenome project with Mayo Clinic + Université Paris-Saclay on reference bias in underrepresented populations, supervising postdocs and students across the entire project. 1st author in BioMolecules (Q1) · DOI 10.3390/biom15040582.
  • Deployed pharmacogenomics (PharmCAT), variant effect prediction (sVEP), GWAS (VariantSpark) and federated data exchange (sBeacon) pipelines to partner AWS accounts in Indonesia, Korea, Mayo Clinic, and Australian clinical sites (see blog post).
  • Represented CSIRO in federated-learning consortia with Microsoft, MIT, NVIDIA, and the South Australian government.
  • Mentored 15+ team members across IT, medical, and bioinformatics disciplines. Winner of the CSIRO Health & Biosecurity Entrepreneurship Award (2024).
AWSTerraformFHIRBlockchain · IPFSGA4GH · HIPAA · GDPRPharmacogenomicsGWASPangenomeFederated learningTeam leadership (10+)15+ mentoredGrant managementScientific writing
Mar 2022 – Aug 2022

Quantitative Analyst

AE Capital Hedge Fund · Melbourne, Australia

Validated ML signals and modelled high-dimensional financial time-series across equities, bonds, and macro indicators in a fast-paced hedge-fund environment.

  • Built predictive models for bid / ask price movement and anomaly detection on multi-asset time-series.
  • Applied statistical modelling, feature engineering, and ML signal validation for pattern recognition and risk management.
  • Conducted EDA on noisy, high-dimensional datasets to support systematic trading research and strategy development.
Time-series modellingPredictive modelsBid / ask spreadAnomaly detectionML signal validationRisk managementFeature engineeringEDAPythonMATLABCross-functional collab
Aug 2018 – Apr 2022

PhD in Bioinformatics

University of Adelaide

Thesis: "Quantifying and Reducing Biases in Genomic Research Using Pangenome." Three Q1 first-author papers, co-authored FAIR/CARE community standards for reproducible aDNA research.

Pangenome graphsaDNADeepVariantPopulation geneticsD / F-statisticsHPC · SLURMMethod benchmarkingReproducibility (FAIR/CARE)TeachingScientific writing
2013 – 2018

Earlier research & software roles

CNRS · ANU · University of Auckland
  • Research Software Developer · CNRS · 2017–2018: developed and published the MPEE ancestral sequence reconstruction model in PhyML (Bioinformatics, Oxford, Q1).
  • Research Intern · ANU · 2016: implemented MCMC phylogeographic methods in the R package Phyloland, applying Bayesian inference to migration simulation.
  • Web Developer Intern · 2014: built features and managed the SQL database for BasiCompta, a financial-tracking web app for sports associations (JavaScript, jQuery, PHP).
  • Research Intern · University of Auckland · 2013: built a gene database for endometriosis research with the ENCODE consortium, annotating SNPs via the UCSC Genome Browser.
PhylogeneticsBayesian stats · MCMCC · R · PythonSQLWeb devGenome annotation
Publications, talks & writing

Selected papers, conference talks, and blog posts.

Selected publications 6 first-author Q1

  • BioMolecules Q12025
    A pangenomic approach to improve population genetics analysis and reference bias in underrepresented Middle Eastern and Horn of Africa populations
    Oliva A. et al. · 1st author
  • Q1 journal Q12025
    Lessons learned: recommendations for reproducible paleogenomic data analyses
    Oliva A., Souilmi Y. et al. · 1st author
  • GigaScience Q12024
    Future-proofing genomic data and consent management: a comprehensive review of technology innovations
    Oliva A. et al. · 1st author · DOI 10.1093/gigascience/giae021
  • Briefings in Bioinformatics Q12021
    Systematic benchmark of ancient DNA read mapping
    Oliva A. et al. · 1st author · IF 9.5 · DOI 10.1093/bib/bbab076
  • Ecology & Evolution Q12021
    Additional evaluations show that specific BWA-aln settings still outperform BWA-mem for ancient DNA data alignment
    Oliva A., Tobler R., Llamas B., Souilmi Y. · 1st author · DOI 10.1002/ece3.8297
  • Bioinformatics (Oxford) Q12019
    Accounting for ambiguity in ancestral sequence reconstruction
    Oliva A. et al. · 1st author

Selected talks 2020 – 2024

  • ABACBS · 2024 AI Beyond OMICS: Navigating Data Governance ChallengesTalk + Workshop
  • ABACBS · 2024 GeneGuardian: Secure consent and data managementWorkshop
  • EY · 2024 AI Impact on Digital Health in AustraliaIndustry
  • GA4GH · 2024 Precision Medicine for Coronary Heart Disease in Somali PopulationPoster
  • APIdays · 2023 The Power of APIs in Genomic ResearchTalk
  • CNRS · 2023 Cloud in Genomics & Personalised MedicineInvited
  • ABACBS · 2020 Systematic Benchmark of aDNA Mapping BiasTalk

Blog posts CSIRO Bioinformatics

Skills

Core stack for bioinformatics, applied ML, and quantitative analysis.

{ } Languages

  • Pythondaily · 8+ yrs
  • Rdaily · 8+ yrs
  • SQLdaily
  • Bash / shelldaily
  • CPhD + CNRS
  • MATLABAE Capital
  • JavaScript / PHPproject-level

Σ ML & modelling

  • Biostatisticsexpert
  • Variant detectionexpert
  • Anomaly detectionexpert
  • Time-seriesadvanced
  • Bayesian / MCMCadvanced
  • Random forest / GBMadvanced
  • Deep learning (DeepVariant)advanced
  • scikit-learn · PyTorchapplied

Domain

  • Genomics · NGS · WGSexpert
  • Pangenome graphsexpert
  • Variant callingexpert
  • Microbiome (16S, Nanopore)production
  • Pharmacogenomicsadvanced
  • Population genetics · GWASadvanced
  • Pathogen detectionproduction

Cloud & eng.

  • AWS (EC2 · S3 · Lambda)daily
  • HPC / SLURMadvanced
  • Git · CI/CDdaily
  • Docker · Singularitydaily
  • Terraform · IaCproduction
  • Snakemake / Nextflowproduction
  • FHIR · GA4GHadvanced

Leadership

  • Team lead (3–10)3+ yrs
  • Student supervision15+ mentored
  • Agile · Scrum · Jiradaily
  • Stakeholder commsC-suite ↔ farmers
  • Grant / project mgmt$3.2M+ delivered
  • SOPs · reproducibilityauthored
  • TeachingBioinformatics 101

Data & governance

  • SQL · DbSchemadaily
  • HIPAA · GDPRproduction
  • GA4GH standardsadvanced
  • Consent · federated learningapplied
  • IPFS · FileCoin · Storjdeployed
  • Pandas · Polarsdaily

Quant edge

  • Multi-asset time-seriesAE Capital
  • ML signal validationapplied
  • Feature engineeringadvanced
  • EDA on noisy datanative
  • Risk & statistical testingadvanced
  • Pattern recognitionapplied

AI / LLM

  • Claude API · OpenAIdaily
  • Local LLMs (Ollama · Qwen)production
  • Agentic systemsbuilt framework
  • LLM-assisted codingnative
  • Multi-agent orchestrationOpenClaw + Hermes
  • Workflow automationshipped
Beyond the day job

Agentic AI, quant passion, and personal projects.

Quant finance

A passion I keep alive in code.

Quantitative finance has been a long-standing passion. I had a first taste at AE Capital Hedge Fund and would love another shot at it. In the meantime I keep that muscle alive by merging it with my agentic AI work: algorithmic trading pipelines, signal validation, and anomaly detection on market data are some of the live use cases of the OpenClaw framework.

Time-seriesSignalsAlgo trading
Personal edge

Side hustles, sport, and learning loops.

Outside work, I like staying active and curious: volunteering as a basketball coach, paragliding when I can, building casual coding prototypes, playing video games, and getting around a good board game. It is less about polishing a side-hustle brand and more about keeping learning loops alive.

Basketball coachingParaglidingCoding prototypesVideo gamesBoard games

Let's talk about what you're building.

Best fits: senior hands-on or tech-lead roles in bioinformatics, computational biology, or data science. Also genuinely keen on quantitative analyst or quant research positions. Open to Australia, Europe, or remote. Response within 48 hours.

adrioliva@hotmail.fr →