SQL for data science (Record no. 64119)

MARC details
000 -LEADER
fixed length control field 03782dam a2200325Ii 4500
001 - CONTROL NUMBER
control field 0000372243
003 - CONTROL NUMBER IDENTIFIER
control field 0001
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 221213s2020 sz# ob 001 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9783030575915 (paperback)
Qualifying information (electronic bk.)
International Standard Book Number 3030575926
Qualifying information (electronic bk.)
Canceled/invalid ISBN 3030575918
Canceled/invalid ISBN 9783030575915
035 ## - SYSTEM CONTROL NUMBER
System control number (OCoLC)1205607630
Canceled/invalid control number (OCoLC)1225563876
-- (OCoLC)1227334462
040 ## - CATALOGING SOURCE
Original cataloging agency YDX
Language of cataloging eng
Description conventions rda
Transcribing agency YDX
Modifying agency SFB
-- UAB
-- OCLCF
-- EBLCP
-- GW5XE
-- OCLCO
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 005.74
Edition number 23
084 ## - OTHER CLASSIFICATION NUMBER
Classification number 005.74
Item number BAD-S
Number source 23
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Badia, Antonio,
Authority record control number or standard number http://id.loc.gov/authorities/names/nb2009013244
Relator term author
245 10 - TITLE STATEMENT
Title SQL for data science
Medium Book :
Remainder of title data cleaning, wrangling and analytics with relational databases /
Statement of responsibility, etc. Antonio Badia.
300 ## - PHYSICAL DESCRIPTION
Extent xi, 285 pages :
Dimensions 23 cm.
365 ## - TRADE PRICE
Price type code 01
Price amount 11,347.18
440 #0 - SERIES STATEMENT/ADDED ENTRY--TITLE
Title Data-centric systems and applications.
Authority record control number http://id.loc.gov/authorities/names/no2003128521
International Standard Serial Number 2197-9723
490 1# - SERIES STATEMENT
Series statement Data-centric systems and applications,
International Standard Serial Number 2197-9723
500 ## - GENERAL NOTE
General note Includes references and index
520 ## - SUMMARY, ETC.
Summary, etc. This textbook explains SQL within the context of data science and introduces the different parts of SQL as they are needed for the tasks usually carried out during data analysis. Using the framework of the data life cycle, it focuses on the steps that are very often given the short shift in traditional textbooks, like data loading, cleaning and pre-processing. The book is organized as follows. Chapter 1 describes the data life cycle, i.e. the sequence of stages from data acquisition to archiving, that data goes through as it is prepared and then actually analyzed, together with the different activities that take place at each stage. Chapter 2 gets into databases proper, explaining how relational databases organize data. Non-traditional data, like XML and text, are also covered. Chapter 3 introduces SQL queries, but unlike traditional textbooks, queries and their parts are described around typical data analysis tasks like data exploration, cleaning and transformation. Chapter 4 introduces some basic techniques for data analysis and shows how SQL can be used for some simple analyses without too much complication. Chapter 5 introduces additional SQL constructs that are important in a variety of situations and thus completes the coverage of SQL queries. Lastly, chapter 6 briefly explains how to use SQL from within R and from within Python programs. It focuses on how these languages can interact with a database, and how what has been learned about SQL can be leveraged to make life easier when using R or Python. All chapters contain a lot of examples and exercises on the way, and readers are encouraged to install the two open-source database systems (MySQL and Postgres) that are used throughout the book in order to practice and work on the exercises, because simply reading the book is much less useful than actually using it. This book is for anyone interested in data science and/or databases. It just demands a bit of computer fluency, but no specific background on databases or data analysis. All concepts are introduced intuitively and with a minimum of specialized jargon. After going through this book, readers should be able to profitably learn more about data mining, machine learning, and database management from more advanced textbooks and courses
521 ## - TARGET AUDIENCE NOTE
Target audience note All.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Database management.
Authority record control number or standard number http://id.loc.gov/authorities/subjects/sh85035848
Topical term or geographic name entry element Big data.
Authority record control number or standard number http://id.loc.gov/authorities/subjects/sh2012003227
Topical term or geographic name entry element SQL (Computer program language)
Authority record control number or standard number http://id.loc.gov/authorities/subjects/sh86006628
852 ## - LOCATION
Accession No. 10001000062606
-- 11347.18
-- Allied Book Company
Former shelving location Books
Holdings
Withdrawn status Lost status Damaged status Not for loan Home library Current library Date acquired Total checkouts Total renewals Full call number Barcode Date last seen Date last checked out Price effective from Koha item type
        Junaid Zaidi Library, COMSATS University Islamabad Junaid Zaidi Library, COMSATS University Islamabad 09/09/2023 1 1 005.74 BAD-S 62606 10001000062606 01/13/2025 11/14/2024 09/09/2023 Books