Amazon cover image
Image from Amazon.com

Introducing data science: big data, machine learning, and more, using Python tools

By: Cielen, DavyContributor(s): Meysman, Arno D.B | Ali, MohamedMaterial type: TextTextPublication details: New Delhi Dreamtech Publisher 2023 Description: xx, 300 pISBN: 9789351199373Subject(s): Machine learning | Python (Computer program language) | Big data | Data miningDDC classification: 006.312 Summary: Introducing Data Science explains vital data science concepts and teaches you how to accomplish the fundamental tasks that occupy data scientists. You’ll explore data visualization, graph databases, the use of NoSQL, and the data science process. You’ll use the Python language and common Python libraries as you experience firsthand the challenges of dealing with data at scale. Discover how Python allows you to gain insights from data sets so big that they need to be stored on multiple machines, or from data moving so quickly that no single machine can handle it.
List(s) this item appears in: Marketing
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Collection Call number Copy number Status Date due Barcode
Book Book Indian Institute of Management LRC
General Stacks
IT & Decisions Sciences 006.312 CIE (Browse shelf(Opens below)) 1 Available 004863

1. Data science in a Big Data world
1.1. Benefits and uses of data science and Big Data
1.2. Facets of data
1.3. The data science process
1.4. The Big Data ecosystem and data science
1.5. An introductory working example of Hadoop
1.6. Summary

2. The data science process
2.1. Overview of the data science process
2.2. Step 1: defining research goals and creating a project charter
2.3. Step 2: retrieving data
2.4. Step 3: cleansing, integrating, and transforming data
2.5. Step 4: exploratory data analysis
2.6. Step 5: Build the models
2.7. Step 6: Presenting findings and building applications on top of them
2.8. Summary

3. Machine learning
3.1. What is machine learning and why should you care about it?
3.2. The modelling process
3.3. Types of machine learning
3.4. Semi-supervised learning
3.5. Summary

4. Handling large data on a single computer
4.1. The problems you face when handling large data
4.2. General techniques for handling large volumes of data
4.3. General programming tips for dealing with large datasets
4.4. Case study 1: predicting malicious URLs
4.5. Case study 2: building a recommender system inside a database
4.6. Summary

5. First steps in Big Data
5.1. Distributing data storage and processing with frameworks
5.2. Case study: assessing risk when loaning money
5.3. Summary

6. Join the NoSQL movement
6.1. Introduction to NoSQL
6.2. Case study: what disease is that?
6.3. Summary

7. The rise of graph databases
7.1. Introducing connected data and graph databases
7.2. Introducing Neo4j: a graph database
7.3. Connected data example: a recipe recommendation engine
7.4. Summary

8. Text mining and text analytics
8.1. Text mining in the real world
8.2. Text mining techniques
8.3. Case study: classifying Reddit posts
8.4. Summary


9. Data visualization to the end user

Introducing Data Science explains vital data science concepts and teaches you how to accomplish the fundamental tasks that occupy data scientists. You’ll explore data visualization, graph databases, the use of NoSQL, and the data science process. You’ll use the Python language and common Python libraries as you experience firsthand the challenges of dealing with data at scale. Discover how Python allows you to gain insights from data sets so big that they need to be stored on multiple machines, or from data moving so quickly that no single machine can handle it.

There are no comments on this title.

to post a comment.

©2019-2020 Learning Resource Centre, Indian Institute of Management Bodhgaya

Powered by Koha