Cover image for The data wrangler's handbook : simple tools for powerful results
The data wrangler's handbook : simple tools for powerful results
Title:
The data wrangler's handbook : simple tools for powerful results
ISBN:
9780838919095
Publication:
Chicago : ALA Neal-Schuman, 2019.
Physical Description:
xx, 164 pages : illustrations ; 23 cm
General Note:
Includes index.
Contents:
Getting started with the command line -- Command line concepts -- Understanding formats / by David Forero -- Simplify complicated problems -- Delimited text -- XML -- JSON (JavaScript Object Notation) -- Scripting -- Solving common problems -- Conclusions.
Abstract:
"Like all organizations, libraries are generating more data than ever before and are keen to use it. Data manipulation and analysis is far easier than most people imagine. This book demystifies the process of working with data, familiarizing readers with a small number of simple tools, and easily digestible but powerful concepts. Using tools that come with desktop computers, readers will learn to extract, manipulate, and analyze data (and metadata) of any size and complexity. Kyle Banerjee, experienced author of in data and digital library topics, is determined to take the fear out of the command line. This book will be useful to librarians developing their skills, introducing concepts and tools gradually. Starter topics, most of which can be accomplished with a single-word command, will include: how to use the output of one program as input for another, redirecting the results of that to any file or program, sorting files of any size by any criteria, identifying duplicates, listing the number of occurrences for each entry. As readers develop a firm grasp of the fundamentals, they will learn progressively more sophisticated tasks such as comparing files, converting data from one format to another, reformatting values (e.g. converting inconsistent dates to a consistent format), combining data from multiple files, and communicating with APIs (Application Programming Interfaces) built into their systems. Each chapter with more examples that power users might appreciate, but others can skip over without impeding their ability to understand anything else in the book." -- Provided by publisher.
Other Format:
Online version: Banerjee, Kyle, The data wrangler's handbook Chicago : American Library Association, 2019. 9780838919101 (DLC) 2019024259