What is Big Data --- A complete Description
What is Big Data???
Big data is an elusive concept.
It represents an amount of digital information,
which is uncomfortable to store, transport, or
analyze. Big data is so voluminous that
it overwhelms the technologies of the day and
challenges us to create the next generation of
data storage tools and techniques. So,
big data isn’t new. In fact, physicists
at CERN have been rangling with the
challenge of their ever-expanding big data for decades. Fifty
years ago, CERN’s data could be stored in
a single computer. OK, so it wasn’t your
usual computer, this was a mainframe
computer that filled an entire building. To
analyze the data, physicists from around
the world traveled to CERN to connect to
the enormous machine. In the 1970’s, our
ever-growing big data was distributed
across different sets of computers, which
mushroomed at CERN.
During the early 2000’s,the
continued growth of our big data outstripped
our capability to analyze it at CERN, despite
having buildings full of computers. We
had to start distributing the petabytes of data to
our collaborating partners in order to
employ local computing and storage at
hundreds of different institutes. In
order to orchestrate these interconnected resources with
their diverse technologies, we
developed a computing grid, enabling
the seamless sharing of computing
resources around the globe. This relies
on trust relationships and mutual exchange. But
this grid model could not be transferred out
of our community so easily, where not
everyone has resources to share nor
could companies be expected to have the
same level of trust. Instead, an
alternative, more business-like approach for
accessing on-demand resources has been
flourishing recently, called cloud
computing, which other communities are
now exploiting to analyzing their big data. It
might seem paradoxical for a place like CERN, a
lab focused on the study of the
unimaginably small building blocks of matter, to
be the source of something as big as big data. But
the way we study the fundamental particles, as
well as the forces by which they interact, involves
creating them fleetingly, colliding
protons in our accelerators and
capturing a trace of them as they zoom
off near light speed.
To see those traces,our
detector, with 150 million sensors, acts
like a really massive 3-D camera, taking
a picture of each collision event - that’s
up to 14 millions times per second. That
makes a lot of data. But if big data
has been around for so long, why do we
suddenly keep hearing about it now? Well,
as the old metaphor explains, the whole
is greater than the sum of its parts, and
this is no longer just science that is exploiting this. The
fact that we can derive more knowledge by
joining related information together and
spotting correlations can inform and
enrich numerous aspects of everyday life, either
in real time, such as traffic or
financial conditions, in short-term
evolutions, such as medical or
meteorological, or in predictive
situations, such as business, crime, or
disease trends.
Comments
Post a Comment