Maria Rivera Araya

Cloud data specialist and educator

Biography

I am an cloud data specialist who loves using technology to improve science. I blend the latest cloud data ingestion, orchestration and analysis tools to optimize scientific workflows.

I use data management, statistical models and geospatial tools to process complex and large environmental data analysing and uncovering hidden patterns in data. I am very passionate about teaching and applying reproducible research and the use of open source programming languages to analyse and visualise data.

Interests

Data engineering
Reproducible and Open Science
Data Science
Software engineering best practices for data scientists
Environmental history
Science outreach

Education

PhD Natural and Physical Sciences (Geochemistry), 2021

James Cook University, Australia
Master in Geography (Postgraduate Fulbright Scholar), 2017

University of Georgia, United States
B.S. in Chemistry, 2014

Universidad de Costa Rica, Costa Rica
B.A. in Anthropology, 2013

Universidad de Costa Rica, Costa Rica

Skills

Cloud tools and infrastructure

Azure Data Factory, Synapse Analytics, Azure Blob Storage, Azure Machine Learning, Azure Databricks, Git, GitHub

Programming Languages

Python and R to perform supervised and unsupervised data mining techniques. Bayesian modelling and Soil and Water Assessment Tool (SWAT). Use of decision trees, random forest, K neighbors and XG Boost to predict environmental indicator

Big data tools

Apache Spark and Databricks

Data management

Reproducible and open science using RMarkdown, Jupyter Notebooks and Git. Application of the FAIR Data Principles and use of Relational Databases (SQL)

Teaching and communication

Cloud fundamental concepts and hands on tutorials. Data science subjects and workshops: The Carpentries R for Reproducible Scientific Analysis, Statistical Comparisons, Data Visualisation, Data Mining and Foundations of Data Science. Sample of teaching slides

Geospatial analysis

Geoprocessing, spatial statistics, remote sensing using Python

Teaching cloud and data skills

Cloud Fundamentals for environmental scientists

Department of Environment, Science and Innovation Dec 2023

Used The Carpentries Workbench to build a curriculum to onboard scientists in the use of cloud tools

Tutorials for geospatial analysis in the cloud

Department of Environment, Science and Innovation Jun 2023

Certified Instructor

Queensland Infrastructure Foundation/The Carpentries Jul 2019

Instructor and tutor Master of Data Science

James Cook University Jun 2019 – Jul 2021

Delivered tutorials and lectures and graded assessments in several subjects, including data visualisation, foundations of data science, statistical methods for data scientists and introduction to data mining.

Projects

End-to-end scientific cloud pipelines for reproducible science

Machine learning models to predict sediment types

Past climate and environmental changes in Northern Australia

Using organic remains to model unknown ages in sediments

Using elemental abundance, metals and algae to reconstruct hydrology

Monitoring drought using satellite images in Costa Rica (NASA)

Monitoring mangrove health using remote sensing in India (NASA)

Featured Awards

Service Award Best Indigenous Peer Assisted Learning Advisor

Indigenous Education and Research Centre, James Cook University Apr 2019

This award recognises the outstanding contributions of learning advisors to the advancement of Indigenous and Torres Strait Islanders students in higher education

World Fellowship

Delta Kappa Gamma Society International for Key Women Educators Aug 2015

Awarded to promote professional and personal growth during my studies at the University of Georgia

Fulbright Foreign Student Program Scholarship

United States Government Jan 2015 – May 2017

The program is one of the most prestigious and impactful academic exchange programs in the world. Awarded to study my masters in Geography at the University of Georgia