Extracted, cleaned and pre-processed over 13 million records from remote SQL database. Trained XGboost valuation model on AWS EC2.
A guide on how to set up Spark with Jupyter on AWS EC2 instances with S3 I/O support. Presented at Toronto Apache Spark #19.
A solution for determining the most optimal placement of location-based information maps throughout Toronto.
I use python multiprocessing to preprocess Lung CT Images efficiently on all available CPU cores on AWS compute instances.
An exploration of satellite images using AWS S3 and boto3 for the kaggle DSTL Satellite Imagery Feature Detection challenge.
* As a member of the University of Toronto Data Science Team (UDST).
How machine learning teams can apply Modern Agile and Extreme Programming engineering principles to deliver high-quality, flexible and low cost-of-change ML projects that yield a net reduction in development time and production time.
Technical presentation on state-of-the-art NLP model Google BERT
Practical and theoretical methodologies for applying deep learning to real-world applications, including public health sciences, based on techniques employed in real-world contexts.
MAT245: Mathematical Methods in Data Science: An introduction to the mathematical methods behind scientific techniques developed for extracting information from large data sets.
|3T0 M. & P. and Associates Scholarship||2018|
|Norman Stuart Robertson Scholarship in Mathematics||2017|
|Coexter Scholarship in Mathematics||2017|
|C.L. Burton Scholarships for Mathematics and/or Physical Sciences||2017, 2016|
|Third Place at HackOn(Data)||2016|
|NSERC Undergraduate Student Research Award||2016 (UToronto)
|Dr. James A. & Connie P. Dickson Scholarship in Science and Math||2015|
|University of Toronto Scholar
Joseph Alfred Whealey Incourse Scholarship
|Howard Ferguson Provincial Scholarship||2014
2015*, 2016*, 2017*
A list of useful data science resources.
What I’ve read.