Breaking the Jargons #4: August Edition
Machine learning pitfalls, cutecharts, Github features, and more…
Hi there!
Welcome to the fourth edition of the newsletter. In this newsletter, we'll look at a cool visualization library, get some Kaggle tips from a Taiwanese Grandmaster and look at ways to use Github more efficiently. There is also a guide to help newcomers avoid mistakes when using machine learning within an academic research context. To conclude, there are also a bunch of helpful resources mentioned.
📜Articles
Increase the cuteness quotient of your charts
Cutecharts is a Python library that renders interactive and hand-drawn charts in Python. It basically mimics the xkcd charts library but in python. So we get some cool-looking, handmade charts that are also interactive.
A Tour of 10 Useful Github Features
This article compiles some valuable tips and hacks that I have discovered over time while using Github. These have been gathered from various sources over time. I have filtered out the ones that were too familiar to avoid repetition. I’m sure you’ll find the list useful, and you might like to use them in your day-to-day work.
Different ways of getting datasets for your data science tasks
This article is a compilation of the various techniques to acquire datasets. For instance, there are some websites where you can find suitable datasets. Alongside there are ways to create your own datasets too. One can think of it as summarizing the multiple techniques while linking back to the original articles.
🎙️ Interviews
It was an absolute pleasure to interview Kunhao Yeh, a Kaggle competition Grandmaster from Taiwan. He shared how he initially wanted to become a professional GO player and later transitioned to machine learning and ultimately Kaggle. He shares that the people who were once his mentors became his colleagues.
🔬 Research Paper Summary
If you are beginning your journey in machine learning research, there are few salient points that, if kept in mind, will help you immensely. The following paper by Michael Lones is an excellent guide and a reminder to be aware of the pitfalls of machine learning for academic researchers.
Link to the paper: How to avoid machine learning pitfalls: a guide for academic researchers
💡 Concept corner
Chris Moffitt's "Effectively Using Matplotlib" is an excellent crash course into Matplotlib terminology and usage. This graphic from the same post is also super useful.
🎁 Resource of the Month
The PyCon US 2021 conference tutorials, talks, workshops, summits, and more are available on their YouTube channel and contain great sessions and presentations.
That is all for this edition. See you with another roundup next month. You can subscribe to receive the newsletter directly in your mailbox every month or share it with someone who could find them helpful.