vish

69 Flips | 1 Magazine | 3 Following | 1 Follower | @mvish | Keep up with vish on Flipboard, a place to see the stories, photos, and updates that matter to you. Flipboard creates a personalized magazine full of everything, from world news to life’s great moments. Download Flipboard for free and search for “vish”

Designing a Data Architecture to Support both Fast and Big Data

<i>Originally written by Scott Jarr for VoltDB.</i><p>In post one of this series, we introduced the ideas that a Corporate Data Architecture was taking shape …

Kaggle Competition Past Solutions

[edit: last update at 2014/06/27. My apologies, have been very busy the past few months.]<p>We learn more from code, and from great code. Not …

Data Science 101: Real-time Analytics using Cassandra, Spark and Shark - insideBIGDATA

In the video below, Evan Chan (Software Engineer at Ooyala), describes his experience using the Spark and Shark frameworks for running real-time …

Lionel Messi Is Impossible

In their Group F World Cup match late last month, Argentina and Iran were still deadlocked after 90 minutes. With the game in stoppage time and the …

Soccer

Unlocking Big Data's Value Potential Through Design with Small Data

<i>Co-written by Finn Birger Lie</i><p>Big Data is perceived as the next value opportunity for corporations to innovate and grow. To be of any real value, however, Big Data has to become more accessible and understandable to non-specialist users and leave the domain of the data specialists.<p>The advent of …

Everything You Wanted to Know About Machine Learning, But Were Too Afraid To Ask (Part One)

Recently, Professor Pedro Domingos, one of the top machine learning researchers in the world, wrote a great article in the Communications of the ACM …

Cluster Computing for $0.27/hr using Amazon EC2 and IPython Notebook

<i>This is a guest post by Randy Zwitch (@randyzwitch), a digital analytics and predictive modeling consultant in the Greater Philadelphia area. Randy</i> …

Philip Guo - Data Science Workflow: Overview and Challenges

Data Science Workflow: Overview and Challenges<p>October 2013 (perspective of a postdoc)<p>During my Ph.D., I created tools for people who write programs …

Data Science

Supervised learning: predicting an output variable from high-dimensional observations — scikit-learn 0.19.1 documentation

Linear model: from regression to sparsity¶<p>Diabetes dataset<p>The diabetes dataset consists of 10 physiological variables (age, sex, weight, blood …

Where everyone in the world is migrating—in one gorgeous chart

It’s no secret that the world’s population is on the move, but it’s rare to get a glimpse of where that flow is happening. In a study released in Science, a team of geographers used data snapshots to create a broad analysis of global migrations over 20 years.<p>The study was conducted by three …

Know this right now about Hadoop

I write a lot about Hadoop, for the obvious reason that it's the biggest thing going on right now. Last year everyone was talking about it -- and …

Big data's bogus correlations

<b>Detangling the truth from the spurious, the curious and the injurious</b><p>Coincidences are correlations, of course. However, they're the weakest possible …

Data Analytics at eBay

In the blog post Using Spark to Ignite Data Analytics eBay's analytics team explains how Apache Spark fits into its analytic data …

10 things statistics taught us about big data analysis

<b>If the goal is prediction accuracy, average many prediction models together</b>. In general, the prediction algorithms that most frequently win Kaggle …

An excellent introduction to MapReduce and Hadoop

by Yanchang Zhao, RDataMining.com<p>The lectures in week 3 of a free online course Introduction to Data Science give an excellent introduction to …

ŷhat | 10 Books for Data Enthusiasts

Synopsis<p>An overview of machine learning and the key algorithms in use today. Each chapter outlines a problem, defines an approach to solving it using …

How-to: Select the Right Hardware for Your New Hadoop Cluster

One of the first questions Cloudera customers raise when getting started with Apache Hadoop is how to select appropriate hardware for their new …

Out in the Open: Build Your Own Netflix-Style Suggestion Machine for Free

Netflix has spent years building and improving its recommendation engine, and even sponsored a $1 million contest to improve its algorithm. But now …

Home

SECURITY<p>A primary concern, enterprises can look to AXON Platform as a proven model to prevent potential threats to their business.<p>MEDIA<p>AXON Platform …

When machine learning takes a lesson from human learning

Advertisement<p>2 Comments<p>If you haven’t heard Ramona Pierson’s story, you’re missing out on a great source of inspiration. It involves a broken home, a …

With 40% Of U.S. Doctors Signed On, Doximity’s Jeff Tangney Reveals How The Social Network For M.D.s Hit The Tipping Point

With the arrival of Obamacare, millions of uninsured Americans are entering the healthcare system for the first time. As these new patients happily stream into waiting rooms, doctors are scrambling to keep pace with the increasing demand. Preserving a high standard of care amidst the waiting room …

How Big Data Analytics is Aiding Search for Flight 370

As the hours and days go by following the sudden and mysterious disappearance of Malaysia Airlines Flight 370 somewhere in Southeast Asia, more …

Learn-to-code company Treehouse gives students a browser-based text editor

You don’t need a separate, offline text editor whenever you’re learning to code with Treehouse.<p>The code-education service announced today it now offers a browser-based developing environment for CSS, HTML, and JavaScript.<p>That means you, the student, have less to worry about — no need to think about …

How to build your own Facebook Sentiment Analysis Tool

• February 3, 2014<br>• Vasilis Vryniotis<br>• . 7 Comments<p>In this article we will discuss how you can build easily a simple Facebook Sentiment Analysis tool …

Machine Learning

A key investor walks you through the big-data technology stack (video)

So far, big data startups continue to command hefty funding rounds in 2014, just like they did last year. If you want to know why, watch this video.<p>In a new video, Jake Flomenberg of Accel Partners lays out his view of the big data market and the investing opportunities he’s excited about. He’s …

An Introduction to D3

by Sam Selikoff February 24, 2014<p>D3.js is a JavaScript library used to create interactive visualizations in the browser. This tutorial discusses some …