Data Science Related Tutorials

By Data Science Renee | Items I find as @becomingdatasci that look like good instructional resources for learning data science techniques. (tutorials, explainers, references, etc.)

Emulating R plots in Python

Jul 11, 2017 <br>6 minute read<p><i>Update: Cook’s distance lines on last plot, and cleaned up the code a bit!</i><p>Recently, as a part of my Summer of Data Science …

Jupyter Notebook Viewer

Possible outcomes of a classification task and the confusion matrix¶<p>If we are in a binary classification problem where the two classes are $1$ (call …

Neural Networks from Scratch (in R)

This post is for those of you with a statistics/econometrics background but not necessarily a machine-learning one and for those of you who want some …

Getting Started with tidyverse in R

The tidyverse is a collection of R packages developed by RStudio’s chief scientist Hadley Wickham. These packages work well together as part of …

Mapping County Demographic Data in R

<i>Ad:</i><p><i>Share:</i><p>Share on Facebook 84<p>Share on Twitter11<p>Share on Google Plus 0<p>Share on LinkedIn 14<p>Share on Digg<p>Print<p><i>Ari Lamstein, a technology consultant and</i> …

GIS

Data Analysis of IMDB Data

Data Analysis

Static and dynamic network visualization with R

<b>[June 2017 update]</b> This tutorial is continuously updated and expanded. The latest version includes additional information (more on multiplex graphs, …

Science

R Data Science Tutorials

This repo contains a curated list of R tutorials and packages for Data Science, NLP and Machine Learning. This also serves as a reference guide for …

How A Data Scientist Can Improve Productivity

<b>By Dmitry Petrov, @FullStackML</b>.<p>Data science and machine learning are iterative processes. It is never possible to successfully complete a data …

Data Science

R powered web applications with Shiny (a tutorial and cheat sheet with 40 example apps)

Shiny at its simplest<p>In its simplest form, a Shiny application requires a server function to do the calculations and a user interface. Below we have …

You Must Allow Me To Tell You How Ardently I Admire and Love Natural Language Processing

It is a truth universally acknowledged that sentiment analysis is super fun, and <i>Pride and Prejudice</i> is probably my very favorite book in all of …

A Better Way to Code

Introducing d3.express: the integrated discovery environment.<p>If you’ve ever gotten frustrated trying to figure out why your code doesn’t work, or how …

H-1B Visa Petitions Data Analysis using R (Part II): Data Analysis

<i>This post is part of a series of blogs on exploration of H-1B visa petitions public dataset using R language.</i><p>Part I: Data Wrangling<p>Part II: Data …

NBA Foul Calls and Bayesian Item Response Theory

Posted on April 4, 2017<p>(<b>Author’s note</b>: many thanks to Robert ([@atlhawksfanatic](https://twitter.com/atlhawksfanatic) on Twitter) for pointing out …

Playing with dimensions: from Clustering, PCA, t-SNE... to Carl Sagan!

Hi there! This post is an experiment combining the result of <b>t-SNE</b> with two well known clustering techniques: <b>k-means</b> and <b>hierarchical</b>. This will be the …

How to mine newsfeed data and extract interactive insights in Python

The web is an overcrowded space of data. In fact, you will find it in different shapes and formats, from simple tabular sheets like excel files to …

Learning AI if You Suck at Math — P4 — Tensors Illustrated (with Cats!)

Learning AI if You Suck at Math — P4 — Tensors Illustrated (with Cats!)<p>Welcome to part four of Learning AI if You Suck at Math. If you missed parts …

Beginner’s Guide to Customer Segmentation

This post originally appeared on the Yhat blog. <b>Yhat</b> is a Brooklyn based company whose goal is to make data science applicable for developers, data …

Data Science

What's Wrong With My Time Series

What’s wrong with my time series? Model validation without a hold-out setTime series modeling sits at the core of critical business operations such …

Data Science

Convolutional neural net for teeth detection

In this blog post, you will learn how to create a complete machine learning pipeline that solves the problem of telling whether or not a person in a …

#Examples

#Basic Without Scales<p>#Tooltips<p>#Responsive with Types and Hover<p>Example showing how to dynamically change anntation types<p>#Reimagining the Circle …

Creating Scatter plot Matrix in Tableau

In this article we are going to learn to create scatter plot matrix for the chosen dataset. Scatter plot matrix is a great way to roughly determine …

Data Science

Exploring the Common Crawl with Python

Common Crawl is a nonprofit organization that crawls the web and provides the contents to the public free of charge and under few restrictions. The …

R for Excel Users

R for Excel Users

Data Science

Poor Donald – his tweets keep getting more negative

(This article was first published on <b>jacobsimmering.com</b>, and kindly contributed to R-bloggers)<p>Last summer, David Robinson did this interesting text …

Fun With Plotly

RECENTLY I HAVE BEEN EXPLORING FLEXDASHBOARDS to visualize data. In this post I want to focus on a tool I’ve found particularly useful, plotly.<p>Plotly …

Getting Started With R in RStudio Notebooks

R is a powerful statistical programming language for manipulating, graphing, and modeling data. One of the major positive aspects of R is that it’s …

David Meza

Graphing a Lesson Learned Database for NASA Using Neo4j, R/RStudio & Linkurious<p>Ask any project manager and they will tell you the importance of …

Python for Data Analysis Part 24: Hypothesis Testing and the T-Test

Point estimates and confidence intervals are basic inference tools that act as the foundation for another inference technique: statis...