Big Data for Geeks

By Rick Gansler | Technical articles about the Hadoop ecosystem. Discussions about the hardware, use cases, and applications.

The Defense Department’s data strategy: Huge, massive and distributed

Advertisement<p>1 Comment<p>Ely Kahn has spent more than a decade working in the national security world, including stints at the Transportation Safety …

Cloudera is rebuilding machine learning for Hadoop with Oryx

Advertisement<p>2 Comments<p>Credit: Wikimedia Commons / Thepedestrian<p>Hadoop software vendor Cloudera didn’t make a lot of waves when it bought a …

INTERVIEW: Marilyn Matz, CEO of Paradigm4 - insideBIGDATA

Paradigm4 is the company behind SciDB, a scalable array database with native complex analytics. CEO Marilyn Matz is an expert in the field of big …

Installing Hadoop on a #RaspberryPi

Hadoop is a framework, written in Java, for handling large datasets. According to the Apache website:<p>The Apache Hadoop software library is a …

Western Union deploys Cloudera Hadoop tools for transactional data analytics

Share<p>Global payments provider Western Union has implemented a Hadoop-based data analytics platform from Cloudera to help provide a more personalised …

New Intel Big Data Platform Includes Analytics Toolkit for Developers

Intel Corp. last week announced a new platform that seeks to simplify Big Data analytics while improving upon the capabilities of straight Apache …

Teradata threads Q4 Hadoop needle for now | ZDNet

Teradata's fourth quarter earnings were solid, but analysts peppered management with questions about Hadoop as data warehouse revenue worries persist.<p>…

Out in the Open: Hacker Vows to Instantly Analyze Your Big Data

These days, Hadoop is everywhere.<p>It began as an esoteric data-crunching platform used by vanguard web companies like Yahoo, Facebook, and Twitter, …

Hydra is Now Open Source

Today we are happy to announce that Hydra—the core of our data processing platform—is now open source and available on github. It’s freely available …

Splunk Digs Into the Year of the Big Data Application

<b>Big Data: Not Just for Big Business Anymore</b><p>If 2013 was the year that most organizations discovered what <b>Big Data</b> platforms such as <b>Hadoop</b> were all …

Operational Intelligence, Log Management, Application Management, Enterprise Security and Compliance

Splunk® Enterprise<p>See the Forest and the Trees<p>Collect, analyze and act upon the untapped value of the big data generated by your technology …

Splunk Debuts Hunk Analytics for Hadoop

While Hadoop provides an excellent framework for storing massive amounts of data, deriving any business value from all that information is quite …

Hadoop-as-a-Service Provider Qubole Now Runs on Google Compute Engine

Qubole, a managed Hadoop-as-a-Service offering is now available on Google Compute Engine (GCE). Qubole was so far only available on Amazon’s AWS and …

Channel Partners

article<p>Savvy channel partners are learning how to differentiate between the many cloud marketplaces popping up in order to find...<p>opinion<p>Insiders can …

Elastic

October 31, 2013 News<p>Elasticsearch and Hortonworks Partner<p>What You Need to Know<p>We've got some exciting news to share around Elasticsearch and Hadoop. …

Fast Search and Analytics on Hadoop with Elasticsearch

<b>Hortonworks customers can now enhance their Hadoop applications with Elasticsearch real-time data exploration, analytics, logging and search …

Page Redirection

Resource

Open source Jaspersoft BI links into Amazon Hadoop | ZDNet

As well as launching version 5.5 of its business intelligence platform, Jaspersoft has integrated its tools with Amazon's Elastic MapReduce Hadoop …

Pivotal puts more of its platform pieces together

Advertisement<p>1 Comment<p>Pivotal, the company spun out of EMC and VMware last year to build a next-gen data analytics platform, is putting more pieces …

Computerworld India | news

NEXTDC closes in on Asia Pacific Data Centre deal<p>NEXTDC has announced a takeover offer to acquire all the securities of Asia Pacific Data Centre …

Apache Ambari

OVERVIEW<p>A completely open source management platform for provisioning, managing, monitoring and securing Apache Hadoop clusters. Apache Ambari takes …

The Future of Ambari

What a difference a year makes! Last Fall Ambari was a nascent Apache project that had recently shipped an inaugural release in the community. Fast …

HyperDex 1.1: Backups, Macs and Graphs

We are proud to announce HyperDex 1.1, the next generation NoSQL data store that provides ACID transactions, fault-tolerance, and high-performance. …

Linux

Apache Cassandra

Proven<p>Cassandra is in use at Constant Contact, CERN, Comcast, eBay, GitHub, GoDaddy, Hulu, Instagram, Intuit, Netflix, Reddit, The Weather Channel, …

Get your data in RAM. Get compute close to data. Enjoy the performance.

Tarantool - Get your data in RAM. Get compute close to data. Enjoy the performance.¶<p><hidden><p>Get your data in RAM. Get compute close to data. Enjoy …

Voldemort is a distributed key-value storage system

• Data is automatically replicated over multiple servers.<br>• Data is automatically partitioned so each server contains only a subset of the total …