Hey I'm Matthew! I follow Jesus and work in aviation embedded systems software at Dynetics. I previously worked as a Data Scientist in Baseball R&D @ Cleveland Guardians after finishing my EE PhD in 2023. From 2017-2022 I interned in spacecraft software development at NASA Johnson Space Center in Houston, TX.

I have two types of content here:

  • Learning resources that tackle specific topics and are iteratively improved
  • Blog posts that deal with specific example problems or tutorials

Posts

  • Is This A Bear?

    How would you explain whether an image contains a bear? Maybe you’d point out the ears, the snout, or the color of the fur. But how would you come up with a number that quantitatively describes the “bearyness” of the image?

  • Autopicking Available SLURM Queues

    I do a lot of my neural network training on a computing cluster at the University of Kentucky with GPUs. The cluster uses SLURM to submit jobs that run when enough resources (computer nodes) are available. This lets me run batches of experiments simultaneously and just watch the results from my desktop computer in the lab.

  • Launching Your Machine Learning Models with XGBoost(ers)

    Recently, I was asked to pick and explain a predictive machine learning technique for a job application. One such technique is gradient boosted decision trees. I’ll introduce decision trees and then what “boosting” means and how “gradient” fits into this. I’ll use a baseball example of predicting a fielder’s putout rate given the direction, launch angle, and exit velocity of a batted ball and then return to it in more concise and less technical way at the end.

  • How Many Bears Does It Take To Launch Into Space

    You want to go to space, but you want to do it your way. None of this rocket stuff. That’s already been done. No, you need an original way to hurl yourself into the sky. Obviously, your first thought is a seesaw. Those are made for just this sort of thing.

  • Yes, But What Exactly Is The Blockchain?

    The blockchain is a list of transactions. However, this list is not stored in one common place but is copied across many computers, none of which are considered the “true” copy. Each computer keeps its own version of the list, so the list is distributed and no single person can assume control. The trick is keeping all the computers in sync.

  • Deep Landscaping

    Deep learning can be split into three parts that must all work together to learn the desired task (like recognizing images, generating text, or detecting machine failures):

  • The Best Analysis of the Best First Guess in Wordle

    There has been much ado across the Internet lately about the best first guess for Wordle. I can’t review all the proposed words, but a few include roate, arose, soare, later, and adieu.

  • Let's Calculate a Lagrange Point For Fun

    The James Webb Space Telescope needs to be consistently shielded from the light and heat of the Sun and that reflected by the Earth. Ultimately, we need two things:

  • Attention, But Explained Like You're A Normal Human

    If you don’t have a deep learning background, everything about this will be confusing if it isn’t already.

    Sorry.

  • Pro(ish) Tips On How To Code Machine Learning Models

    With the amount of Python coding I do for machine learning, I have stumbled across some workflow tips that have greatly improved my standard of living. Check out these recommendations when developing your next breakthrough deep learning model.

  • You Should Use ISO 8601

    Consider the following from popular web comic XKCD:

  • So How Exactly Can I Meet My Goodreads Goal?

    After my previous post on simulating how many books I will read this year, I naturally asked myself, “Well, if 30 pages per day and taking 3 days to find another book to read won’t cut it, how fast should I read and how fast should I find new books if I want to meet my Goodreads goal?” Undoubtedly, you have also lost sleep over this question, wondering if there is any hope left for my literary aspirations. No longer. Welcome to the exciting conclusion of Matthew, Monte Carlo, and the Unrealistic Reading Goal.

  • Making Sense of the Embedded Landscape

    The world of embedded hardware and firmware is confusing.

  • Markov Chain Monte Carlo Sampling

    One of my courses recently introduced Markov Chain Monte Carlo (MCMC) sampling, which has a lot of applications. I’d like to dive into those applications in a future post, but for now let’s take a quick look at Metropolis-Hastings MCMC.

subscribe via RSS