Big Data for Geeks

By Rick Gansler | Technical articles about the Hadoop ecosystem. Discussions about the hardware, use cases, and applications.

The Defense Department’s data strategy: Huge, massive and distributed

Ely Kahn has spent more than a decade working in the national security world, including stints at the Transportation Safety Administration, …

Cloudera is rebuilding machine learning for Hadoop with Oryx

Hadoop software vendor Cloudera didn’t make a lot of waves when it bought a London-based startup called Myrrix last year, and it hasn’t made a lot of …

INTERVIEW: Marilyn Matz, CEO of Paradigm4 - insideBIGDATA

Paradigm4 is the company behind SciDB, a scalable array database with native complex analytics. CEO Marilyn Matz is an expert in the field of big …

Installing Hadoop on a #RaspberryPi

Hadoop is a framework, written in Java, for handling large datasets. According to the Apache website:<p>The Apache Hadoop software library is a …

Western Union deploys Cloudera Hadoop tools for transactional data analytics

Share<p>Global payments provider Western Union has implemented a Hadoop-based data analytics platform from Cloudera to help provide a more personalised …

Big Data

New Intel Big Data Platform Includes Analytics Toolkit for Developers

Intel Corp. last week announced a new platform that seeks to simplify Big Data analytics while improving upon the capabilities of straight Apache …

Teradata threads Q4 Hadoop needle for now

Teradata's fourth quarter earnings were solid, but analysts peppered management with questions about Hadoop as data warehouse revenue worries persist.<p>…

Out in the Open: Hacker Vows to Instantly Analyze Your Big Data

These days, Hadoop is everywhere.<p>It began as an esoteric data-crunching platform used by vanguard web companies like Yahoo, Facebook, and Twitter, …

Hydra is Now Open Source

Today we are happy to announce that Hydra—the core of our data processing platform—is now open source and available on github. It’s freely available …

Splunk Digs Into the Year of the Big Data Application

<b>Big Data: Not Just for Big Business Anymore</b><p>If 2013 was the year that most organizations discovered what <b>Big Data</b> platforms such as <b>Hadoop</b> were all …

SIEM, AIOps, Application Management, Log Management, Machine Learning, and Compliance

IT Operations<p>I see that one of my servers is down, has that impacted the health of my service?<p>How do I predict service-level degradation before it …

Splunk Debuts Hunk Analytics for Hadoop

While Hadoop provides an excellent framework for storing massive amounts of data, deriving any business value from all that information is quite …

Big Data

Hadoop-as-a-Service Provider Qubole Now Runs on Google Compute Engine

Qubole, a managed Hadoop-as-a-Service offering is now available on Google Compute Engine (GCE). Qubole was so far only available on Amazon’s AWS and …

Channel Partners

article<p>LiveAction's channel chief says advanced network monitoring can boost partners' SD-WAN engagements.<p>opinion<p>It's not too late to establish your …

Elastic

31 October 2013 News<p>Elasticsearch and Hortonworks Partner<p>What You Need to Know<p>We've got some exciting news to share around Elasticsearch and Hadoop. …

Fast Search and Analytics on Hadoop with Elasticsearch

This website is for sale!

Resource

Open source Jaspersoft BI links into Amazon Hadoop

As well as launching version 5.5 of its business intelligence platform, Jaspersoft has integrated its tools with Amazon's Elastic MapReduce Hadoop …

Pivotal puts more of its platform pieces together

Pivotal, the company spun out of EMC and VMware last year to build a next-gen data analytics platform, is putting more pieces of that platform …

Apache Ambari

OVERVIEW<p>A completely open source management platform for provisioning, managing, monitoring and securing Apache Hadoop clusters. Apache Ambari takes …

Ubiquitous Computing

The Future of Ambari

What a difference a year makes! Last Fall Ambari was a nascent Apache project that had recently shipped an inaugural release in the community. Fast …

Big Data

HyperDex 1.1: Backups, Macs and Graphs

We are proud to announce HyperDex 1.1, the next generation NoSQL data store that provides ACID transactions, fault-tolerance, and high-performance. …

Linux

Apache Cassandra

Proven<p>Cassandra is in use at Constant Contact, CERN, Comcast, eBay, GitHub, GoDaddy, Hulu, Instagram, Intuit, Netflix, Reddit, The Weather Channel, …

Ubiquitous Computing

Tarantool In Memory Data Grid

Voldemort is a distributed key-value storage system

• Data is automatically replicated over multiple servers.<br>• Data is automatically partitioned so each server contains only a subset of the total …

Databases