data science portfolio

Posted

Benjamin Obi Tayo is a Ph.D., physicist, data scientist, educator and writer, and he has given permission for Smith Hanley to share his article on creating a data science portfolio with you. Utilize Benjamin’s great suggestions to improve your job search. Interested in more assistance on your job search?  Contact the Data Science and Analytics Recruiters at Smith Hanley Associates.

Data Science is a practical field. Hands-on skills are very important especially when you are interested in working outside academia as a practicing data scientist. In academia, you need more theoretical and research skills. While in-depth knowledge in the theoretical foundations of data science is important, as a practicing data scientist, hands-on experience is very crucial and one way to showcase your hands-on skills is via building a data science portfolio. Companies interested in hiring you would definitely be asking you for a portfolio, as it gives evidence of your strengths in fundamental data science concepts.

While a data science resume is important, a data science portfolio is more valuable than a resume. This article will discuss three important platforms that can be used for portfolio building. Before delving into the topic of building a good data science portfolio, let’s first discuss five reasons why a data science portfolio is important.

Five Reasons Why a Data Science Portfolio is Important

  1. A portfolio helps you showcase your data science skills.
  2. A portfolio enables you to network with other data science professionals and leaders in the field.
  3. A portfolio is good for bookkeeping. You can use it to keep a record of your completed projects, including datasets, codes, and sample output files. That way, if you have to work on a similar project, you can always use code that has already been written with only minor modifications.
  4. By building a portfolio and networking with other data science professionals and leaders, you are exposed to technological changes in the field. Data science is a field that is ever-changing due to advances in technology. To keep up with the latest changes and developments in the field, it is important to join a network of data science professionals.
  5. A portfolio increases your chances of getting a job. I’ve had numerous opportunities from LinkedIn through recruiters reaching out to me for job opportunities in data science.

Platforms for Building a Data Science Portfolio

1. GitHub

GitHub is a very useful platform for displaying your data science projects. This platform enables you to share your code with other data scientists or data science aspirants. Employers interested in hiring you would check your GitHub portfolio to assess some of the projects you’ve completed. It’s important for you to build a very strong and professional portfolio on GitHub.

To establish a GitHub portfolio, the first thing to do is to create a GitHub account. Once your account has been created, you may go ahead and edit your profile. When editing your profile, it is a good idea to add a short biography and a professional profile picture. Here is my GitHub profile.

Let’s assume that you’ve completed an important data science project and you would like to create a GitHub repository for your project. Here are a few tips for creating a repository: Make sure you choose a suitable title for your repository. Include a README file to provide a synopsis of what your project is all about. Upload your project files including the dataset, Jupyter Notebook, and sample outputs.

Here is an example of a GitHub repository for a machine learning project:

Repository Name: bot13956/ML_Model_for_Predicting_Ships_Crew_Size
Repository URL: https://github.com/bot13956/ML_Model_for_Predicting_Ships_Crew_Size
README File: ML_Model_for_Predicting_Ships_Crew_Size
Author: Benjamin O. Tayo
Date: 4/8/2019
We build a simple model using the cruise_ship_info.csv data set for predicting a ship’s crew size. This project is organized as follows: (a) data proprocessing and variable selection; (b) basic regression model; (c) hyper-parameters tuning; and (d) techniques for dimensionality reduction.
cruise_ship_info.csv: dataset used for model building.
Ship_Crew_Size_ML_Model.ipynb: the Jupyter Notebook containing code.

2. LinkedIn

LinkedIn is a very powerful platform for showcasing your skills and for networking with other data science professionals and organizations. LinkedIn is now one of the most utilized platforms for posting data science jobs and for recruiting data scientists. I’ve actually had two data science interviews via LinkedIn.

Make sure your profile is up-to-date at all times. List your data science skill sets, as well as your experiences, including projects that you’ve completed. It would be worthwhile to also list awards and honors. You also want to let recruiters know that you are actively searching for a job. On LinkedIn you want to keep up-to-date by following data science influencers and publications such as Towards Data Science and Towards AI. These companies post updates on interesting data science articles on various topics including machine learning, deep learning, and artificial intelligence.

Here is an example of my posts on LinkedIn.

3. Medium

Medium is now considered one of the fastest growing platforms for portfolio building and for networking. If you are interested in using this platform for data science portfolio building, the first step would be to create a Medium account. You can create a free account or a member account. With a free account, there are limitations on the number of member articles that you can actually access per month. A member account requires a monthly subscription fee of $5/month or $50/year. Find out more about becoming a Medium member here.

Once you’ve created an account, you can go ahead and create a profile. Make sure to include a professional picture and a short bio. Here is my Medium profile.

On Medium, a good way to network with other data science professionals is to become a follower. You can also follow specific Medium publications that are focused on data science. The two top data science publications are Towards Data Science and Towards AI. One of the best ways to enhance your portfolio on Medium is to become a medium writer.There are five reasons why you should consider writing data science articles on Medium.

  1. It provides a means for you to showcase your knowledge and skills in data science.
  2. It motivates you to work on challenging data science projects, thereby improving your data science skills.
  3. It enables you to improve your communication skills. This is useful because it enables you to convey information in a way that the general public can understand.
  4. Every article published on Medium is considered intellectual property, so you can add a medium article to your resume.
  5. You can make money from your articles. By means of the Medium Partner Program, anyone who publishes on Medium can make their articles eligible for earning money.

If you are interested in becoming a data science medium writer, here are some resources that can get you started:Beginner’s Guide to Writing Data Science Blogs on Medium and Choose the Right Featured Image For Your Data Science Articles.

In summary, we’ve discussed three important platforms that could be used for building a data science portfolio. A data science portfolio is a very important way for you to showcase your skills and to network with other data science professionals. A good portfolio will not only help you keep up-to-date with new developments in the field, but it will help increase your visibility to potential recruiters.


Leave a Reply

Your email address will not be published. Required fields are marked *