DuckDB has quickly become one of the most practical tools for modern data analytics. Lightweight, fast, and simple to set up, it allows you to query massive datasets directly from files without the need for servers or expensive warehouses. Whether you are working with CSV, Parquet, or JSON, DuckDB makes analytics more efficient and more accessible.
This book is a complete guide for analysts, data engineers, and developers who want to get the most out of DuckDB. It takes you step by step from installation and SQL fundamentals to advanced performance tuning and real-world projects. Instead of theory, it focuses on workflows you can apply immediately in Python, R, or directly from the command line.
Inside you will learn:
• How to install and use DuckDB across Windows, macOS, Linux, Python, and R
• How to query raw CSV, Parquet, and JSON files directly without staging data
• Techniques for building ETL pipelines with DuckDB as the transformation layer
• Methods for joining and consolidating datasets across multiple file formats
• Integration with Pandas, Polars, and R dataframes for analytics and data science
• Preparing training datasets for machine learning with Scikit-Learn and PyTorch
• Performance optimizations including vectorization, caching, and parallel execution
• Using DuckDB with BI tools, dashboards, and embedded applications
• Strategies for cloud integration with S3, GCS, and Azure
• Hands-on projects covering sales analytics, ETL automation, predictive modeling, benchmarking, and full pipeline design
Every chapter combines clear explanations with code examples and end-of-section projects so you can reinforce your learning with practical exercises. SQL examples are paired with Python and R code, giving you the flexibility to adapt the workflows to your own environment.
You will see how DuckDB compares to SQLite, Pandas, PostgreSQL, and cloud warehouses, and where it fits best in the modern data stack. You will also explore advanced topics like time-series optimization, partitioned data strategies, community extensions, and future ecosystem developments with MotherDuck and cloud services.
Whether you are a beginner learning SQL, a data scientist preparing machine learning datasets, or a developer embedding analytics into applications, this book gives you the knowledge and confidence to put DuckDB to work effectively.
Analytics should be fast, accessible, and free of unnecessary infrastructure. DuckDB makes this possible, and this guide shows you how to master it with practical, real-world workflows.
"synopsis" may belong to another edition of this title.
Seller: Grand Eagle Retail, Bensenville, IL, U.S.A.
Paperback. Condition: new. Paperback. DuckDB has quickly become one of the most practical tools for modern data analytics. Lightweight, fast, and simple to set up, it allows you to query massive datasets directly from files without the need for servers or expensive warehouses. Whether you are working with CSV, Parquet, or JSON, DuckDB makes analytics more efficient and more accessible.This book is a complete guide for analysts, data engineers, and developers who want to get the most out of DuckDB. It takes you step by step from installation and SQL fundamentals to advanced performance tuning and real-world projects. Instead of theory, it focuses on workflows you can apply immediately in Python, R, or directly from the command line.Inside you will learn: - How to install and use DuckDB across Windows, macOS, Linux, Python, and R- How to query raw CSV, Parquet, and JSON files directly without staging data- Techniques for building ETL pipelines with DuckDB as the transformation layer- Methods for joining and consolidating datasets across multiple file formats- Integration with Pandas, Polars, and R dataframes for analytics and data science- Preparing training datasets for machine learning with Scikit-Learn and PyTorch- Performance optimizations including vectorization, caching, and parallel execution- Using DuckDB with BI tools, dashboards, and embedded applications- Strategies for cloud integration with S3, GCS, and Azure- Hands-on projects covering sales analytics, ETL automation, predictive modeling, benchmarking, and full pipeline designEvery chapter combines clear explanations with code examples and end-of-section projects so you can reinforce your learning with practical exercises. SQL examples are paired with Python and R code, giving you the flexibility to adapt the workflows to your own environment.You will see how DuckDB compares to SQLite, Pandas, PostgreSQL, and cloud warehouses, and where it fits best in the modern data stack. You will also explore advanced topics like time-series optimization, partitioned data strategies, community extensions, and future ecosystem developments with MotherDuck and cloud services.Whether you are a beginner learning SQL, a data scientist preparing machine learning datasets, or a developer embedding analytics into applications, this book gives you the knowledge and confidence to put DuckDB to work effectively.Analytics should be fast, accessible, and free of unnecessary infrastructure. DuckDB makes this possible, and this guide shows you how to master it with practical, real-world workflows. This item is printed on demand. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Seller Inventory # 9798265720399
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: New. Seller Inventory # 51519598-n
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: As New. Unread book in perfect condition. Seller Inventory # 51519598
Seller: PBShop.store UK, Fairford, GLOS, United Kingdom
PAP. Condition: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # L0-9798265720399
Quantity: Over 20 available
Seller: GreatBookPricesUK, Woodford Green, United Kingdom
Condition: New. Seller Inventory # 51519598-n
Quantity: Over 20 available
Seller: GreatBookPricesUK, Woodford Green, United Kingdom
Condition: As New. Unread book in perfect condition. Seller Inventory # 51519598
Quantity: Over 20 available
Seller: CitiRetail, Stevenage, United Kingdom
Paperback. Condition: new. Paperback. DuckDB has quickly become one of the most practical tools for modern data analytics. Lightweight, fast, and simple to set up, it allows you to query massive datasets directly from files without the need for servers or expensive warehouses. Whether you are working with CSV, Parquet, or JSON, DuckDB makes analytics more efficient and more accessible.This book is a complete guide for analysts, data engineers, and developers who want to get the most out of DuckDB. It takes you step by step from installation and SQL fundamentals to advanced performance tuning and real-world projects. Instead of theory, it focuses on workflows you can apply immediately in Python, R, or directly from the command line.Inside you will learn: - How to install and use DuckDB across Windows, macOS, Linux, Python, and R- How to query raw CSV, Parquet, and JSON files directly without staging data- Techniques for building ETL pipelines with DuckDB as the transformation layer- Methods for joining and consolidating datasets across multiple file formats- Integration with Pandas, Polars, and R dataframes for analytics and data science- Preparing training datasets for machine learning with Scikit-Learn and PyTorch- Performance optimizations including vectorization, caching, and parallel execution- Using DuckDB with BI tools, dashboards, and embedded applications- Strategies for cloud integration with S3, GCS, and Azure- Hands-on projects covering sales analytics, ETL automation, predictive modeling, benchmarking, and full pipeline designEvery chapter combines clear explanations with code examples and end-of-section projects so you can reinforce your learning with practical exercises. SQL examples are paired with Python and R code, giving you the flexibility to adapt the workflows to your own environment.You will see how DuckDB compares to SQLite, Pandas, PostgreSQL, and cloud warehouses, and where it fits best in the modern data stack. You will also explore advanced topics like time-series optimization, partitioned data strategies, community extensions, and future ecosystem developments with MotherDuck and cloud services.Whether you are a beginner learning SQL, a data scientist preparing machine learning datasets, or a developer embedding analytics into applications, this book gives you the knowledge and confidence to put DuckDB to work effectively.Analytics should be fast, accessible, and free of unnecessary infrastructure. DuckDB makes this possible, and this guide shows you how to master it with practical, real-world workflows. This item is printed on demand. Shipping may be from our UK warehouse or from our Australian or US warehouses, depending on stock availability. Seller Inventory # 9798265720399
Quantity: 1 available