Markus Schaber

212 Flips | 2 Magazines | 10 Followers | @MarkusSchab2017 | Keep up with Markus Schaber on Flipboard, a place to see the stories, photos, and updates that matter to you. Flipboard creates a personalized magazine full of everything, from world news to life’s great moments. Download Flipboard for free and search for “Markus Schaber”

Introductory Guide - Factorization Machines & their application on huge datasets (with codes in Python)

Introduction<p><i>I still remember my first encounter with a Click prediction problem. Before this, I had been learning data science and I was feeling good</i> …

A Beginner’s Guide to Data Engineering — Part I

Data Engineering: The Close Cousin of Data Science<p>Motivation<p>The more experienced I become as a data scientist, the more convinced I am that data …

Now available in Amazon SageMaker: DeepAR algorithm for more accurate time series forecasting | Amazon Web Services

Today we are launching Amazon SageMaker DeepAR as the latest built-in algorithm for Amazon SageMaker. DeepAR is a supervised learning algorithm for …

Machine Learning

Confessions of a former hacker: 5 techniques to make you more secure online

Consumers are daily targets of email and phone scams, not to mention the frequent cyberattacks on big data. So it’s never been more important to …

The future of FinTech is racist, according to this anonymous data scientist

<i>This is an excerpt from a long interview between an anonymous data scientist and</i> Logic Magazine <i>about AI, deep learning, FinTech, and the future, conducted in November 2016.</i><p><b>LOGIC:</b> <b>One hears a lot about algorithmic finance and things like</b> robo<b>-advisers. And I’m wondering, is it over-hyped?</b><p><b>DATA</b> …

Apache Kafka Introduction

development<p>6 Comments<p>Integrating systems that every day grow larger is a complex task. Apache Kafka is a software that tries to solve this by using …

TensorFlow for R

Time Series Forecasting with Recurrent Neural Networks<p>In this section, we’ll review three advanced techniques for improving the performance and …

Deep Learning

Short Post: The `future` is `fst`: loading and saving Fannie Mae Loan Acquisition data in 10 lines of code

The future is fst: loading and saving Fannie Mae Loan Acquisition data in 10 lines of code

Data Science

Pipes in R Tutorial For Beginners

(This article was first published on <b> R-posts.com</b>, and kindly contributed to R-bloggers)You might have already seen or used the pipe operator when …

Python Programming

Comprehensive Breakdown Of Mesothelioma Law Firm (All You Need To Know 2018)

For quite a long time, mesothelioma, a life-threatening disease that can influence the lungs, abdomen, and a few other real organs, has been …

Land Registry announces plans for increased open data

Subscribe for full access<p><b>Take out a print and online or online only subscription and you will get immediate access to:</b><p>Breaking industry news as it …

bayesplot

<b>bayesplot</b> is an R package providing an extensive library of plotting functions for use after fitting Bayesian models (typically with MCMC). Currently</b> …

House Price Prediction using a Random Forest Classifier - Data Blogger

In this blog post, I will use machine learning and Python for predicting house prices. I will use a Random Forest Classifier (in fact Random Forest …

Free eBook: Applied Data Science (Columbia University)

Published in 2013, but still very interesting, and different from most data science books. Authors: Ian Langmore and Daniel Krasner.. This book …

Data Science

Simple models in Kaggle competitions

This week I participated in the Porto Seguro Kaggle competition. Basically, you’re asked to predict a binary variable — whether or not an insurance …

Data Science

How to identify risky bank loans using C.50 decision trees

(This article was first published on <b>R-posts.com</b>, and kindly contributed to R-bloggers)<p><b>This tutorial has been taken from Machine Learning with R</b> …

Create Powerpoint presentations from R with the OfficeR package

For many of us data scientists, whatever the tools we use to conduct research or perform an analysis, our superiors are going to want the results as …

An Introduction to Spatial Econometrics in R

4 Spatial Econometrics in R<p>When dealing with space one must bear in mind Tobler’s first law of geography “Everything is related to everything else, …

Geospatial analysis in R

• Exercises<br>• Exercise 01 -- Getting R and RStudio<br>• Exercise 02 -- Univariate plots<br>• Exercise 03 -- Bivariate plots and descriptive statistics<br>• Exercise 04 -- …

Artificial Intelligence vs. Machine Learning vs. Deep Learning

Machine learning and artificial intelligence (AI) are all the rage these days — but with all the buzzwords swirling around them, it's easy to get …

How Random Forests improve simple Regression Trees?

(This article was first published on <b>R – insightR</b>, and kindly contributed to R-bloggers)<p><b>By Gabriel Vasconcelos</b><p>Regression Trees<p>In this post I am going …

TensorFlow for R

<i>This tutorial is intended for readers who are new to both machine learning and TensorFlow. If you already know what MNIST is, and what softmax</i> …

Data governance stumble

Data has become the lifeline that is necessary for most businesses to function with increased efficiency and operational performance. Organizational …

Forming a Data Analytics Practice

The Business Problem<p>The first thing to consider when starting an analytics practice is your core business. Are you in manufacturing or healthcare? …

Big Data

What does GDPR European Union law mean for your business?

Today’s consumers are more powerful than ever before, and get every bit of information that they can before they make a purchase. The Internet is helping them greatly, and most of the buying is done online. The pace is so rapid that it won’t be long before online purchases are more common than …

New EU project Decode wants us to reclaim our personal data for the common good

Led by the technology and innovation office in Barcelona, and delivered by a consortium of 14 European partners including Nesta, Decode is a …

Personal Data

Now you can take a Deep Learning course online, Coursera co-founder says barrier to entry has come down

Coursera’s co-founder and leading AI expert Andrew Ng, who created the certificate programme, told IndianExpress.com that that it is becoming …

Artificial Intelligence

Practical Data Science for Stats

PeerJ Preprints has recently published a collection of articles that focus on the practical side of statistical analysis: Practical Data Science for …

Data Science

How to Extract and Clean Data From PDF Files in R

Do you need to extract the right data from a list of PDF files but right now you’re stuck?<p>If yes, you’ve come to the right place.<p><i>Note: This article</i> …