Jason DrummondCategorical DataOne of the most important and biggest steps in almost any Data Science project is the data preparation phase, this includes loading and…Apr 22, 2021Apr 22, 2021
Jason DrummondSQL Series: SubqueriesA subquery in SQL is a query that is nested inside another query. This simply means that we will have an additional SELECT statement…Apr 20, 2021Apr 20, 2021
Jason DrummondData CollectionWe are generating huge amounts of data on a daily basis simply by using our phones, playing that YouTube video, or by simply paying for a…Apr 13, 2021Apr 13, 2021
Jason DrummondPython: for loopsIn Python we use a for loop to iterate over a sequence, generally this sequence is a list, dictionary, tuple, set, or string…Apr 6, 2021Apr 6, 2021
Jason DrummondConfusion Matrix NOT Confusing MatrixConfusion Matrices are a useful tool for evaluating binary classification models, it can even be extended to multiclass classification…Mar 30, 2021Mar 30, 2021
Jason DrummondVentures into Visualizations: SeabornToday we will be continuing our series on visualization tools, specifically focusing on the Seaborn library. As a reminder, we went over…Mar 23, 2021Mar 23, 2021
Jason DrummondSQL Series: CASEThis is the fifth part in my series teaching the basics of SQL. As a reminder in parts 1–4 we have gone over a few of the most common…Mar 15, 2021Mar 15, 2021
Jason DrummondAggregating Data with PandasWe have seen the importance of the pandas library and its impact on data manipulation and visualization for Data Scientists. We can also…Mar 8, 2021Mar 8, 2021
Jason DrummondData Transformations with PandasPandas is an immensely useful Python package used mainly for data manipulation but also can be used for data visualization. As we have…Mar 2, 2021Mar 2, 2021