Amazon cover image
Image from Amazon.com

Introducing data science: big data, machine learning, and more, using Python tools

By: Contributor(s): Material type: TextTextPublication details: Dreamtech Publisher New Delhi 2023Description: xx, 300 pISBN:
  • 9789351199373
Subject(s): DDC classification:
  • 006.312 CIE
Summary: Introducing Data Science explains vital data science concepts and teaches you how to accomplish the fundamental tasks that occupy data scientists. You’ll explore data visualization, graph databases, the use of NoSQL, and the data science process. You’ll use the Python language and common Python libraries as you experience firsthand the challenges of dealing with data at scale. Discover how Python allows you to gain insights from data sets so big that they need to be stored on multiple machines, or from data moving so quickly that no single machine can handle it.
List(s) this item appears in: Marketing
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Collection Call number Copy number Status Date due Barcode
Book Book Indian Institute of Management LRC General Stacks IT & Decisions Sciences 006.312 CIE (Browse shelf(Opens below)) 1 Available 004863
Browsing Indian Institute of Management LRC shelves, Shelving location: General Stacks, Collection: IT & Decisions Sciences Close shelf browser (Hides shelf browser)
006.312 BIF Machine learning for data streams: 006.312 BRA Principles of data mining 006.312 CHA Research analytics: 006.312 CIE Introducing data science: 006.312 EAG Reality mining: using big data to engineer a better world 006.312 GRI Text as data: 006.312 HOV Text analysis in python for social scientist:

1. Data science in a Big Data world
1.1. Benefits and uses of data science and Big Data
1.2. Facets of data
1.3. The data science process
1.4. The Big Data ecosystem and data science
1.5. An introductory working example of Hadoop
1.6. Summary

2. The data science process
2.1. Overview of the data science process
2.2. Step 1: defining research goals and creating a project charter
2.3. Step 2: retrieving data
2.4. Step 3: cleansing, integrating, and transforming data
2.5. Step 4: exploratory data analysis
2.6. Step 5: Build the models
2.7. Step 6: Presenting findings and building applications on top of them
2.8. Summary

3. Machine learning
3.1. What is machine learning and why should you care about it?
3.2. The modelling process
3.3. Types of machine learning
3.4. Semi-supervised learning
3.5. Summary

4. Handling large data on a single computer
4.1. The problems you face when handling large data
4.2. General techniques for handling large volumes of data
4.3. General programming tips for dealing with large datasets
4.4. Case study 1: predicting malicious URLs
4.5. Case study 2: building a recommender system inside a database
4.6. Summary

5. First steps in Big Data
5.1. Distributing data storage and processing with frameworks
5.2. Case study: assessing risk when loaning money
5.3. Summary

6. Join the NoSQL movement
6.1. Introduction to NoSQL
6.2. Case study: what disease is that?
6.3. Summary

7. The rise of graph databases
7.1. Introducing connected data and graph databases
7.2. Introducing Neo4j: a graph database
7.3. Connected data example: a recipe recommendation engine
7.4. Summary

8. Text mining and text analytics
8.1. Text mining in the real world
8.2. Text mining techniques
8.3. Case study: classifying Reddit posts
8.4. Summary


9. Data visualization to the end user

Introducing Data Science explains vital data science concepts and teaches you how to accomplish the fundamental tasks that occupy data scientists. You’ll explore data visualization, graph databases, the use of NoSQL, and the data science process. You’ll use the Python language and common Python libraries as you experience firsthand the challenges of dealing with data at scale. Discover how Python allows you to gain insights from data sets so big that they need to be stored on multiple machines, or from data moving so quickly that no single machine can handle it.

There are no comments on this title.

to post a comment.

©2019-2020 Learning Resource Centre, Indian Institute of Management Bodhgaya

Powered by Koha