Introduction to Data Science: Data Analysis and Prediction Algorithms with R introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression, and machine learning. It also helps you develop skills such as R programming, data wrangling, data visualization, predictive algorithm building, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation.
This book is a textbook for a first course in data science. No previous knowledge of R is necessary, although some experience with programming may be helpful. The book is divided into six parts: R, data visualization, statistics with R, data wrangling, machine learning, and productivity tools. Each part has several chapters meant to be presented as one lecture.
The author uses motivating case studies that realistically mimic a data scientist’s experience. He starts by asking specific questions and answers these through data analysis so concepts are learned as a means to answering the questions. Examples of the case studies included are: US murder rates by state, self-reported student heights, trends in world health and economics, the impact of vaccines on infectious disease rates, the financial crisis of 2007-2008, election forecasting, building a baseball team, image processing of hand-written digits, and movie recommendation systems.
The statistical concepts used to answer the case study questions are only briefly introduced, so complementing with a probability and statistics textbook is highly recommended for in-depth understanding of these concepts. If you read and understand the chapters and complete the exercises, you will be prepared to learn the more advanced concepts and skills needed to become an expert.
"synopsis" may belong to another edition of this title.
Rafael A. Irizarry is professor of data sciences at the Dana-Farber Cancer Institute, professor of biostatistics at Harvard, and a fellow of the American Statistical Association. Dr. Irizarry is an applied statistician and during the last 20 years has worked in diverse areas, including genomics, sound engineering, and public health. He disseminates solutions to data analysis challenges as open source software, tools that are widely downloaded and used. Prof. Irizarry has also developed and taught several data science courses at Harvard as well as popular online courses.
"I think the book would be perfect for schools looking to make a transition to a model where introduction to data science takes the place of introduction to statistics and maybe introductory computer science." ~Arend Kuyper, Northwestern University
"About this title" may belong to another edition of this title.
Shipping:
US$ 3.25
Within U.S.A.
Seller: Indiana Book Company, Marion, IN, U.S.A.
Condition: Acceptable. The spine/binding has been reinforced with book tape or binding glue. All pages intact. Ships same or next business day with delivery confirmation. Acceptable condition. Contains highlighting. Expedited shipping available. Seller Inventory # 1000009448940-4078
Quantity: 1 available
Seller: Textbooks_Source, Columbia, MO, U.S.A.
hardcover. Condition: Good. 1st Edition. Ships in a BOX from Central Missouri! May not include working access code. Will not include dust jacket. Has used sticker(s) and some writing or highlighting. UPS shipping for most packages, (Priority Mail for AK/HI/APO/PO Boxes). Seller Inventory # 002326249U
Quantity: 2 available
Seller: Upward Bound Books, VALRICO, FL, U.S.A.
Condition: Acceptable. Books may exhibit damage including dents, creases, and folded pages. Some volumes may contain annotations or highlighted sections. PLEASE NOTE that extras or accessories may not be included. Additionally, digital codes and CDs have not been verified for functionality and may be inoperative. Seller Inventory # 59WS4H001SC3_ns
Quantity: 1 available
Seller: TextbookRush, Grandview Heights, OH, U.S.A.
Condition: Good. Ships SAME or NEXT business day. We Ship to APO/FPO addr. Choose EXPEDITED shipping and receive in 2-5 business days within the United States. See our member profile for customer support contact info. We have an easy return policy. Seller Inventory # 53509578
Quantity: 1 available
Seller: Romtrade Corp., STERLING HEIGHTS, MI, U.S.A.
Condition: New. This is a Brand-new US Edition. This Item may be shipped from US or any other country as we have multiple locations worldwide. Seller Inventory # ABTA-7909
Quantity: 1 available
Seller: Textbooks_Source, Columbia, MO, U.S.A.
hardcover. Condition: New. 1st Edition. Ships in a BOX from Central Missouri! UPS shipping for most packages, (Priority Mail for AK/HI/APO/PO Boxes). Seller Inventory # 002326249N
Quantity: 13 available
Seller: Better World Books, Mishawaka, IN, U.S.A.
Condition: Very Good. Used book that is in excellent condition. May show signs of wear or have minor defects. Seller Inventory # 51913033-6
Quantity: 1 available
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: New. Seller Inventory # 35896907-n
Quantity: 10 available
Seller: Grand Eagle Retail, Fairfield, OH, U.S.A.
Hardcover. Condition: new. Hardcover. Covers the basics of R and the tidyverse Demonstrate how to use ggplot2 to generate graphs and describe important Data Visualization principlesIntroduces Data Wranglin topics such as web scrapping, using regular expressions, and joining and reshaping data tables using the tidyverse tools Illustrates the importance of statistics in data analysis using case studies Uses the caret package to build prediction algorithms including K-nearest Neighbors and Random ForestsIncludes tools used on a day-to-day basis in data science projects including RStudio, UNIX/Linux shell, Git and GitHub, and knitr and R Markdown The book begins by going over the basics of R and the tidyverse. You learn R throughout the book, but in the first part we go over the building blocks needed to keep learning during the rest of the book. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Seller Inventory # 9780367357986
Quantity: 1 available
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: As New. Unread book in perfect condition. Seller Inventory # 35896907
Quantity: 10 available