更新时间:2021-06-11 18:35:51
封面
版权信息
Preface
1. Data Storage Fundamentals
Introduction
Problems Solved by Machine Learning
Optimizing the Storing and Processing of Data for Machine Learning Problems
Diving into Text Classification
Looking at Terminology in Text Classification Tasks
Designing for Scale – Choosing the Right Architecture and Hardware
Using Vectorized Operations to Analyze Data Fast
Summary
2. Artificial Intelligence Storage Requirements
Storage Requirements
Data Layers
Raw Data
Historical Data
Streaming Data
Analytics Data
Model Development and Training
3. Data Preparation
ETL
Data Processing Techniques
4. The Ethics of AI Data Storage
5. Data Stores: SQL and NoSQL Databases
Database Components
SQL Databases
MySQL
NoSQL Databases
MongoDB
Cassandra
Exploring the Collective Knowledge of Databases
6. Big Data File Formats
Common Input Files
Choosing the Right Format for Your Data
Introduction to File Formats
7. Introduction to Analytics Engine (Spark) for Big Data
Apache Spark
Apache Spark and Databricks
Understanding Various Spark Transformations
Understanding Various Spark Actions
Best Practices
8. Data System Design Examples
The Importance of System Design
Components to Consider in System Design
Examining a Pipeline Design for an AI System
Making a Pipeline System Highly Available
9. Workflow Management for AI
Creating Your Data Pipeline
Challenges in Managing Processes in the Real World
Automating a Data Pipeline
Automating Asynchronous Data Pipelines
Workflow Management with Airflow
10. Introduction to Data Storage on Cloud Services (AWS)
Interacting with Cloud Storage
Getting Started with Cloud Relational Databases
Introduction to NoSQL Data Stores on the Cloud
Data in Document Format
11. Building an Artificial Intelligence Algorithm
Machine Learning Algorithms
Model Training
Gradient Descent
Getting Started with PyTorch
Mini-Batch SGD with PyTorch
12. Productionizing Your AI Applications
pickle and Flask
Deploying Models to Production
Model Execution in Streaming Data Applications
Appendix
4. Ethics of AI Data Storage