Big Data for Geeks

By Rick Gansler | Technical articles about the Hadoop ecosystem. Discussions about the hardware, use cases, and applications.

The Defense Department’s data strategy: Huge, massive and distributed

Advertisement<p>1 Comment<p>Ely Kahn has spent more than a decade working in the national security world, including stints at the Transportation Safety …

Cloudera is rebuilding machine learning for Hadoop with Oryx

Advertisement<p>2 Comments<p>Credit: Wikimedia Commons / Thepedestrian<p>Hadoop software vendor Cloudera didn’t make a lot of waves when it bought a …

INTERVIEW: Marilyn Matz, CEO of Paradigm4 - insideBIGDATA

Paradigm4 is the company behind SciDB, a scalable array database with native complex analytics. CEO Marilyn Matz is an expert in the field of big …

Installing Hadoop on a #RaspberryPi

Hadoop is a framework, written in Java, for handling large datasets. According to the Apache website:<p>The Apache Hadoop software library is a …

Western Union deploys Cloudera Hadoop tools for transactional data analytics

Share<p>Global payments provider Western Union has implemented a Hadoop-based data analytics platform from Cloudera to help provide a more personalised …

New Intel Big Data Platform Includes Analytics Toolkit for Developers

Intel Corp. last week announced a new platform that seeks to simplify Big Data analytics while improving upon the capabilities of straight Apache …

Teradata threads Q4 Hadoop needle for now

Teradata's fourth quarter earnings were solid, but analysts peppered management with questions about Hadoop as data warehouse revenue worries persist.<p>…

Out in the Open: Hacker Vows to Instantly Analyze Your Big Data

These days, Hadoop is everywhere.<p>It began as an esoteric data-crunching platform used by vanguard web companies like Yahoo, Facebook, and Twitter, …

Hydra is Now Open Source

Today we are happy to announce that Hydra—the core of our data processing platform—is now open source and available on github. It’s freely available …

Splunk Digs Into the Year of the Big Data Application

<b>Big Data: Not Just for Big Business Anymore</b><p>If 2013 was the year that most organizations discovered what <b>Big Data</b> platforms such as <b>Hadoop</b> were all …

Operational Intelligence, Log Management, Application Management, Enterprise Security and Compliance

Splunk Named a Leader in The Forrester Wave™: Security Analytics Platforms, Q1 2017<p>Recognized by Forrester for highest possible score in Real-Time …

Splunk Debuts Hunk Analytics for Hadoop

While Hadoop provides an excellent framework for storing massive amounts of data, deriving any business value from all that information is quite …

Hadoop-as-a-Service Provider Qubole Now Runs on Google Compute Engine

Qubole, a managed Hadoop-as-a-Service offering is now available on Google Compute Engine (GCE). Qubole was so far only available on Amazon’s AWS and …

Channel Partners Online

Opinions<p>February 24, 2017<p>Upheaval in the new administration is driving some companies out of global trade.<p>February 23, 2017<p>Customers are comfortable …

Elastic

31 October 2013 News<p>Elasticsearch and Hortonworks Partner<p>What You Need to Know<p>We've got some exciting news to share around Elasticsearch and Hadoop. …

Fast Search and Analytics on Hadoop with Elasticsearch

<b>Hortonworks customers can now enhance their Hadoop applications with Elasticsearch real-time data exploration, analytics, logging and search …

Page Redirection

Resource

Open source Jaspersoft BI links into Amazon Hadoop

As well as launching version 5.5 of its business intelligence platform, Jaspersoft has integrated its tools with Amazon's Elastic MapReduce Hadoop …

Pivotal puts more of its platform pieces together

Advertisement<p>1 Comment<p>Pivotal, the company spun out of EMC and VMware last year to build a next-gen data analytics platform, is putting more pieces …

Computerworld India | news

Industry leaders gear up for SAS Forum India 2017<p>The forum will focus on big data analytics, cloud, IoT, customer experience, digital and …

Apache Ambari

OVERVIEW<p>A completely open source management platform for provisioning, managing, monitoring and securing Apache Hadoop clusters. Apache Ambari takes …

The Future of Ambari

What a difference a year makes! Last Fall Ambari was a nascent Apache project that had recently shipped an inaugural release in the community. Fast …

HyperDex 1.1: Backups, Macs and Graphs

We are proud to announce HyperDex 1.1, the next generation NoSQL data store that provides ACID transactions, fault-tolerance, and high-performance. …

Linux

Apache Cassandra

Proven<p>Cassandra is in use at Constant Contact, CERN, Comcast, eBay, GitHub, GoDaddy, Hulu, Instagram, Intuit, Netflix, Reddit, The Weather Channel, …

Get your data in RAM. Get compute close to data. Enjoy the performance.

Tarantool - Get your data in RAM. Get compute close to data. Enjoy the performance.¶<p><hidden><p>Get your data in RAM. Get compute close to data. Enjoy …

Big Data

Voldemort is a distributed key-value storage system

• Data is automatically replicated over multiple servers.<br>• Data is automatically partitioned so each server contains only a subset of the total …