Hadoop For Dummies

De (autor)
Notă GoodReads:
en Limba Engleză Paperback – 07 May 2014
Let Hadoop For Dummies help harness the power of your data and rein in the information overload Big data has become big business, and companies and organizations of all sizes are struggling to find ways to retrieve valuable information from their massive data sets with becoming overwhelmed. Enter Hadoop and this easy-to-understand For Dummies guide. Hadoop For Dummies helps readers understand the value of big data, make a business case for using Hadoop, navigate the Hadoop ecosystem, and build and manage Hadoop applications and clusters. * Explains the origins of Hadoop, its economic benefits, and its functionality and practical applications * Helps you find your way around the Hadoop ecosystem, program MapReduce, utilize design patterns, and get your Hadoop cluster up and running quickly and easily * Details how to use Hadoop applications for data mining, web analytics and personalization, large-scale text processing, data science, and problem-solving * Shows you how to improve the value of your Hadoop cluster, maximize your investment in Hadoop, and avoid common pitfalls when building your Hadoop cluster From programmers challenged with building and maintaining affordable, scaleable data systems to administrators who must deal with huge volumes of information effectively and efficiently, this how-to has something to help you with Hadoop.
Citește tot Restrânge

Preț: 12090 lei

Preț vechi: 15113 lei

Puncte Express: 181

Preț estimativ în valută:
2357 2570$ 2074£

Carte disponibilă

Livrare economică 17-28 februarie
Livrare express 04-10 februarie pentru 2704 lei

Preluare comenzi: 021 569.72.76


ISBN-13: 9781118607558
ISBN-10: 1118607554
Pagini: 408
Ilustrații: black & white illustrations, black & white tables, figures
Dimensiuni: 189 x 235 x 22 mm
Greutate: 0.54 kg
Editura: Wiley
Locul publicării: Hoboken, United States

Public țintă

Primary: IT managers, IT workers, data scientists  Secondary: Business managers and decision makers.


Introduction 1 Part I: Getting Started with Hadoop 7 Chapter 1: Introducing Hadoop and Seeing What It's Good For 9 Chapter 2: Common Use Cases for Big Data in Hadoop 23 Chapter 3: Setting Up Your Hadoop Environment 41 Part II: How Hadoop Works 51 Chapter 4: Storing Data in Hadoop: The Hadoop Distributed File System 53 Chapter 5: Reading and Writing Data 69 Chapter 6: MapReduce Programming 83 Chapter 7: Frameworks for Processing Data in Hadoop: YARN and MapReduce 103 Chapter 8: Pig: Hadoop Programming Made Easier 117 Chapter 9: Statistical Analysis in Hadoop 129 Chapter 10: Developing and Scheduling Application Workflows with Oozie 139 Part III: Hadoop and Structured Data 155 Chapter 11: Hadoop and the Data Warehouse: Friends or Foes? 157 Chapter 12: Extremely Big Tables: Storing Data in HBase 179 Chapter 13: Applying Structure to Hadoop Data with Hive 227 Chapter 14: Integrating Hadoop with Relational Databases Using Sqoop 269 Chapter 15: The Holy Grail: Native SQL Access to Hadoop Data 303 Part IV: Administering and Configuring Hadoop 313 Chapter 16: Deploying Hadoop 315 Chapter 17: Administering Your Hadoop Cluster 335 Part V: The Part of Tens 359 Chapter 18: Ten Hadoop Resources Worthy of a Bookmark 361 Chapter 19: Ten Reasons to Adopt Hadoop 371 Index 379

Notă biografică

Dirk deRoos is the technical sales lead for IBM's InfoSphere BigInsights. Paul C. Zikopoulos is the vice president of big data in the IBM Information Management division. Roman B. Melnyk, PhD is a senior member of the DB2 Information Development team. Bruce Brown and Rafael Coss work with big data with IBM.