Data Science Related Tutorials

By Data Science Renee | Items I find as @becomingdatasci that look like good instructional resources for learning data science techniques. (tutorials, explainers, references, etc.)

Emulating R plots in Python

Jul 11, 2017 <br>6 minute read<p><i>Update: Cook’s distance lines on last plot, and cleaned up the code a bit!</i><p>Recently, as a part of my Summer of Data Science …

Data Science

Jupyter Notebook Viewer

Possible outcomes of a classification task and the confusion matrix¶<p>If we are in a binary classification problem where the two classes are $1$ (call …

Data Science

Neural Networks from Scratch (in R)

This post is for those of you with a statistics/econometrics background but not necessarily a machine-learning one and for those of you who want some …

Deep Learning

Getting Started with tidyverse in R

The tidyverse is a collection of R packages developed by RStudio’s chief scientist Hadley Wickham. These packages work well together as part of …

Data Science

Mapping County Demographic Data in R

<i>Ad:</i><p><i>Share:</i><p>Share on Facebook 84<p>Share on Twitter0<p>Share on Google Plus 1<p>Share on LinkedIn 14<p>Share on Digg<p>Print<p><i>Ari Lamstein, a technology consultant and</i> …

Data Analysis of IMDB Data

We all are surrounded by data and it reveals lot of things to us to make our decisions and recommends the next steps. Data is collected from …

Static and dynamic network visualization with R

<b>[June 2017 update]</b> This tutorial is continuously updated and expanded. The latest version includes additional information (more on multiplex graphs, …

Data Visualization

R Data Science Tutorials

This repo contains a curated list of R tutorials and packages for Data Science, NLP and Machine Learning. This also serves as a reference guide for …

How A Data Scientist Can Improve Productivity

<b>By Dmitry Petrov, @FullStackML</b>.<p>Data science and machine learning are iterative processes. It is never possible to successfully complete a data …

R powered web applications with Shiny (a tutorial and cheat sheet with 40 example apps)

Shiny at its simplest<p>In its simplest form, a Shiny application requires a server function to do the calculations and a user interface. Below we have …

JavaScript

You Must Allow Me To Tell You How Ardently I Admire and Love Natural Language Processing

It is a truth universally acknowledged that sentiment analysis is super fun, and <i>Pride and Prejudice</i> is probably my very favorite book in all of …

Data Science

A Better Way to Code

Introducing d3.express: the integrated discovery environment.<p>If you’ve ever gotten frustrated trying to figure out why your code doesn’t work, or how …

H-1B Visa Petitions Data Analysis using R (Part II): Data Analysis

<i>This post is part of a series of blogs on exploration of H-1B visa petitions public dataset using R language.</i><p>Part I: Data Wrangling<p>Part II: Data …

Data Science

NBA Foul Calls and Bayesian Item Response Theory

Posted on April 4, 2017<p>(<b>Author’s note</b>: many thanks to Robert ([@atlhawksfanatic](https://twitter.com/atlhawksfanatic) on Twitter) for pointing out …

Data Science

Playing with dimensions: from Clustering, PCA, t-SNE... to Carl Sagan!

Hi there! This post is an experiment combining the result of <b>t-SNE</b> with two well known clustering techniques: <b>k-means</b> and <b>hierarchical</b>. This will be the …

Data Science

How to mine newsfeed data and extract interactive insights in Python

The web is an overcrowded space of data. In fact, you will find it in different shapes and formats, from simple tabular sheets like excel files to …

Data Science

Learning AI if You Suck at Math — P4 — Tensors Illustrated (with Cats!)

Learning AI if You Suck at Math — P4 — Tensors Illustrated (with Cats!)<p>Welcome to part four of Learning AI if You Suck at Math. If you missed parts …

Deep Learning

Beginner’s Guide to Customer Segmentation

At the core of customer segmentation is being able to identify different types of customers and then figure out ways to find more of those …

What's Wrong With My Time Series

What’s wrong with my time series? Model validation without a hold-out setTime series modeling sits at the core of critical business operations such …

Data Science

Convolutional neural net for teeth detection

In this blog post, you will learn how to create a complete machine learning pipeline that solves the problem of telling whether or not a person in a …

Deep Learning

#Examples

#Basic Without Scales<p>#Tooltips<p>#Responsive with Types and Hover<p>Example showing how to dynamically change anntation types<p>#Reimagining the Circle …

Creating Scatter plot Matrix in Tableau

In this article we are going to learn to create scatter plot matrix for the chosen dataset. Scatter plot matrix is a great way to roughly determine …

Data Science

Exploring the Common Crawl with Python

Common Crawl is a nonprofit organization that crawls the web and provides the contents to the public free of charge and under few restrictions. The …

R for Excel Users

R for Excel Users

Poor Donald – his tweets keep getting more negative

(This article was first published on <b>jacobsimmering.com</b>, and kindly contributed to R-bloggers)<p>Last summer, David Robinson did this interesting text …

Fun With Plotly

RECENTLY I HAVE BEEN EXPLORING FLEXDASHBOARDS to visualize data. In this post I want to focus on a tool I’ve found particularly useful, plotly.<p>Plotly …

Data Science

Getting Started With R in RStudio Notebooks

R is a powerful statistical programming language for manipulating, graphing, and modeling data. One of the major positive aspects of R is that it’s …

Data Science

David Meza

Graphing a Lesson Learned Database for NASA Using Neo4j, R/RStudio & Linkurious<p>Ask any project manager and they will tell you the importance of …

Data Science

Python for Data Analysis Part 24: Hypothesis Testing and the T-Test

Point estimates and confidence intervals are basic inference tools that act as the foundation for another inference technique: statis...