Manny (Mamiya) Adachi

A recent graduate of Masters in Health Data Science (Extension) at UNSW, formerly a business consultant in healthcare, hotel, and retail and ICT business analyst at a medical device company

LinkedIn: https://www.linkedin.com/in/mamiyaad/

This page showcases my data science projects and tasks I engaged in. I included code repositories and figures from some of my works. For the engagements under confidentiality restrictions, I described the overviews and processes on a best-effort basis.

Tip: Each project/task is tagged with keywords. Please search the keywords of your interest using “Ctrl + f” (e.g., statistical modeling/analysis, machine learning, visualization, data processing, image analysis).

Project / Task List

Forecasting all-cause mortality: leveraging cause-of-death data through neural networks [Keyword] Public Health, Supervised Machine Learning, Convolutional Neural Network (CNN), Lee-Carter Model, Dimensionality Reduction, Hyperparameter Tuning, Data Processing, Data Manipulation, Data Management, Exploratory Data Analysis, Data Visualization, Data Interpretation, Python		Winning SAS Institute’s analytics competition and earning the SAS Viya skill certificate [Keyword] Non-Profit Organization, 1st place winner, Internship, Supervised Machine Learning, Data Processing, Exploratory Data Analysis, Data Visualization, SAS Enterprise Miner

Investigating women’s employment status tendency among Canadian couples and families [Keyword] Socioeconomics, Statistical Modeling/Analysis, Generalized Linear Model (GLM), Exploratory Data Analysis, Data Visualization, R Programming		Investigating the media attention impact on dispensing contraceptives in Australia [Keyword] Public Health, Interrupted Time Series, ARIMA, Exploratory Data Analysis, Data Visualization, R Programming

Predicting the hospital readmission of diabetic patients [Keyword] Hospital Operation, Supervised Machine Learning, Random Forest, Logistic Regression, Machine Learning Pipeline, Feature Transformation, Hyperparameter Tuning, Data Visualization, Python		Developing the decision support algorithm for Parkinson’s disease early-stage screening [Keyword] Health Analytics, Decision Support, Supervised Machine Learning, Logistic Regression, Random Forest, Gradient Boosting Machine (GBM), Artificial Neural Network (ANN), Ensemble Models, Machine Learning Pipeline, Feature Transformation, Hyperparameter Tuning, Data Visualization, Python

Distinguishing the medical images (blood cells infected or uninfected by malaria) [Keyword] Medical Image Analysis, Unsupervised Machine Learning, Autoencoder, Neural Network, Data Visualization, Python		Distinguishing the electroencephalogram data of alcoholic or non-alcoholic subjects [Keyword] Health Analytics, Unsupervised Machine Learning, Autoencoder, Long-Short Term Memory (LSTM), Neural Network, Time-Series Data, Data Visualization, Python

Developing a decision support algorithm for hypotensive patient management in the ICU [Keyword] Health Analytics, Decision Support, Reinforcement Learning, Unsupervised Machine Learning, Batch-Constrained Q-Learning (BCQL), K-Means Clustering, Time-Series Data, Data Preprocessing, Data Visualization, Python		Extracting the various specified information from the old datasets of UNSW [Keyword] PostgreSQL, SQL, Data Extraction, Data retrieval

Mapping the emergency departments (ED) and visualizing the distances from the nearest ED [Keyword] Public Health, Spatial Data, Data Visualization, Shiny App, R Programming		Developing the Microsoft Excel-based analysis tools (business setting) [Keyword] Data Manipulation, Data Visualization, Master Data Management, Sales Forecast, Managerial Accounting, MS Excel

Developing and Implementing BI dashboard (business setting) [Keyword] BI dashboard, Process Automation, Data Visualization, SAP Business Object 4.0 Web Intelligence		Data entry, administration, and migration (business setting) [Keyword] Data Entry, Data Administration, Data Migration, Data Warehouse, MS Excel

Consulting a small company by delivering actionable insights from numbers (business setting) [Keyword] Retail Business, Small Business Consulting, Financial Simulation, Managerial Accounting, Business Planning, Data Visualization, MS Excel, MS PowerPoint, MS Word

Main

Forecasting all-cause mortality: leveraging cause-of-death data through neural networks

[Keyword]

Public Health, Supervised Machine Learning, Convolutional Neural Network (CNN), Lee-Carter Model, Dimensionality Reduction, Hyperparameter Tuning, Data Processing, Data Manipulation, Data Management, Exploratory Data Analysis, Data Visualization, Data Interpretation, Python

[Overview]

My master's thesis was developing a new CNN-based mortality forecasting model integrating cause-of-death information using Python. The dataset was the U.S. mortality data from 1959 to 2019, initially with 6 million+ records.
(Repository: https://github.com/MannyAdc/ForecastModel_LC_ML)

[Approach]

n Preprocessed/cleaned the large and fragmented raw datasets (6 million+ records in total) with track and record of the process to enhance the reproducibility of the project output

Ø Documented the cleaning process visually by flowcharts, including the checks on data format and duplication, deletion, and aggregation of rows and columns

Ø Made a data dictionary of the final dataset stating each variable’s description, data type, and range/options

Ø Assessed the data quality by key categories during exploratory data analysis (EDA) and visualization

Ø Omitted/kept abnormalities by iterating the above to avoid the potential "noise" while training the model

Sample Images of Documenting Data Preprocessing/Cleaning, Data Dictionary, and EDA

n Developed the new CNN model and other models for comparison; the models known for weaker performance were omitted beforehand based on the findings from the past research studies

n Assessed the model performance by a comparison table showing the training time and mean square error (MSE) values and graphs of forecast vs. actual of each model that visualized the performance per age group

Comparison Table

Sample Visualization of the Forecast (Dotted Line) vs. Actual (Solid Line)

n Clarified the spot for future studies by mentioning what needed to be added to my thesis's scope

[Outcome]

The new model outperformed most compared models by the smaller total MSE value. The training time was significantly longer than the other models, but this criterion’s priority depends on the (business) context. I presented the result to UNSW professors and research fellows and achieved high distinction.

Project / Task List

Forecasting all-cause mortality: leveraging cause-of-death data through neural networks

Winning SAS Institute’s analytics competition and earning the SAS Viya skill certificate

Investigating women’s employment status tendency among Canadian couples and families

Investigating the media attention impact on dispensing contraceptives in Australia

Predicting the hospital readmission of diabetic patients

Developing the decision support algorithm for Parkinson’s disease early-stage screening

Distinguishing the medical images (blood cells infected or uninfected by malaria)

Distinguishing the electroencephalogram data of alcoholic or non-alcoholic subjects

Developing a decision support algorithm for hypotensive patient management in the ICU

Extracting the various specified information from the old datasets of UNSW

Mapping the emergency departments (ED) and visualizing the distances from the nearest ED

Developing the Microsoft Excel-based analysis tools (business setting)

Developing and Implementing BI dashboard (business setting)

Data entry, administration, and migration (business setting)

Consulting a small company by delivering actionable insights from numbers (business setting)