data analytics case studies: Humanities Data Analysis Folgert Karsdorp, Mike Kestemont, Allen Riddell, 2021-01-12 A practical guide to data-intensive humanities research using the Python programming language The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment. The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter. An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions. Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python Applicable to many humanities disciplines, including history, literature, and sociology Offers real-world case studies using publicly available data sets Provides exercises at the end of each chapter for students to test acquired skills Emphasizes visual storytelling via data visualizations |
data analytics case studies: Case Studies in Neural Data Analysis Mark A. Kramer, Uri T. Eden, 2016-11-04 A practical guide to neural data analysis techniques that presents sample datasets and hands-on methods for analyzing the data. As neural data becomes increasingly complex, neuroscientists now require skills in computer programming, statistics, and data analysis. This book teaches practical neural data analysis techniques by presenting example datasets and developing techniques and tools for analyzing them. Each chapter begins with a specific example of neural data, which motivates mathematical and statistical analysis methods that are then applied to the data. This practical, hands-on approach is unique among data analysis textbooks and guides, and equips the reader with the tools necessary for real-world neural data analysis. The book begins with an introduction to MATLAB, the most common programming platform in neuroscience, which is used in the book. (Readers familiar with MATLAB can skip this chapter and might decide to focus on data type or method type.) The book goes on to cover neural field data and spike train data, spectral analysis, generalized linear models, coherence, and cross-frequency coupling. Each chapter offers a stand-alone case study that can be used separately as part of a targeted investigation. The book includes some mathematical discussion but does not focus on mathematical or statistical theory, emphasizing the practical instead. References are included for readers who want to explore the theoretical more deeply. The data and accompanying MATLAB code are freely available on the authors' website. The book can be used for upper-level undergraduate or graduate courses or as a professional reference. A version of this textbook with all of the examples in Python is available on the MIT Press website. |
data analytics case studies: Data Analysis for Business, Economics, and Policy Gábor Békés, Gábor Kézdi, 2021-05-06 A comprehensive textbook on data analysis for business, applied economics and public policy that uses case studies with real-world data. |
data analytics case studies: Fundamentals of Machine Learning for Predictive Data Analytics, second edition John D. Kelleher, Brian Mac Namee, Aoife D'Arcy, 2020-10-20 The second edition of a comprehensive introduction to machine learning approaches used in predictive data analytics, covering both theory and practice. Machine learning is often used to build predictive models by extracting patterns from large datasets. These models are used in predictive data analytics applications including price prediction, risk assessment, predicting customer behavior, and document classification. This introductory textbook offers a detailed and focused treatment of the most important machine learning approaches used in predictive data analytics, covering both theoretical concepts and practical applications. Technical and mathematical material is augmented with explanatory worked examples, and case studies illustrate the application of these models in the broader business context. This second edition covers recent developments in machine learning, especially in a new chapter on deep learning, and two new chapters that go beyond predictive analytics to cover unsupervised learning and reinforcement learning. |
data analytics case studies: Practical Data Analysis Peter G. Bryant, Marlene A. Smith, 1998-11 Practical Data Analysis: Case Studies in Business Statistics is a collection of 75 class tested case studies for use in introductory business statistics and general statistics. All cases are drawn from real situations in a broad range of business, economic, and social science settings and include small and large data sets for analysis by students. The philosophy behind the package is to let the cases and data drive or supplement the course. Doing so provides three important opportunities for students and instructors: useful computing experience, hands-on activity, which is more motivating than the traditional course format, and a sense of realism about the use of statistics. |
data analytics case studies: Case Studies in Applied Bayesian Data Science Kerrie L. Mengersen, Pierre Pudlo, Christian P. Robert, 2020-05-28 Presenting a range of substantive applied problems within Bayesian Statistics along with their Bayesian solutions, this book arises from a research program at CIRM in France in the second semester of 2018, which supported Kerrie Mengersen as a visiting Jean-Morlet Chair and Pierre Pudlo as the local Research Professor. The field of Bayesian statistics has exploded over the past thirty years and is now an established field of research in mathematical statistics and computer science, a key component of data science, and an underpinning methodology in many domains of science, business and social science. Moreover, while remaining naturally entwined, the three arms of Bayesian statistics, namely modelling, computation and inference, have grown into independent research fields. While the research arms of Bayesian statistics continue to grow in many directions, they are harnessed when attention turns to solving substantive applied problems. Each such problem set has its own challenges and hence draws from the suite of research a bespoke solution. The book will be useful for both theoretical and applied statisticians, as well as practitioners, to inspect these solutions in the context of the problems, in order to draw further understanding, awareness and inspiration. |
data analytics case studies: Thinking with Data Max Shron, 2014-01-20 Many analysts are too concerned with tools and techniques for cleansing, modeling, and visualizing datasets and not concerned enough with asking the right questions. In this practical guide, data strategy consultant Max Shron shows you how to put the why before the how, through an often-overlooked set of analytical skills. Thinking with Data helps you learn techniques for turning data into knowledge you can use. You’ll learn a framework for defining your project, including the data you want to collect, and how you intend to approach, organize, and analyze the results. You’ll also learn patterns of reasoning that will help you unveil the real problem that needs to be solved. Learn a framework for scoping data projects Understand how to pin down the details of an idea, receive feedback, and begin prototyping Use the tools of arguments to ask good questions, build projects in stages, and communicate results Explore data-specific patterns of reasoning and learn how to build more useful arguments Delve into causal reasoning and learn how it permeates data work Put everything together, using extended examples to see the method of full problem thinking in action |
data analytics case studies: Case Studies in Data Analysis Jane F. Gentleman, G.A. Whitmore, 2012-12-06 This volume is a collection of eight Case Studies in Data Analysis that appeared in various issues of the Canadian Journal of Statistics (OS) over a twelve year period from 1982 to 1993. One follow-up article to Case Study No.4 is also included in the volume. The OS's Section on Case Studies in Data Analysis was initiated by a former editor who wanted to increase the analytical content of the journal. We were asked to become Section Co-Editors and to develop a format for the case studies. Each case study presents analyses of a real data set by two or more analysts or teams of analysts working independently in a simulated consulting context. The section aimed at demonstrating the process of statistical analysis and the possible diversity of approaches and conclusions. For each case study, the Co-Editors found a set of real Canadian data, posed what they thought was an interesting statistical problem, and recruited analysts working in Canada who were willing to tackle it. The published case studies describe the data and the problem, and present and discuss the analysts' solutions. For some case studies, the providers of the data were invited to contribute their own analysis. |
data analytics case studies: Data Analytics for Pandemics Gitanjali Rahul Shinde, Asmita Balasaheb Kalamkar, Parikshit N. Mahalle, Nilanjan Dey, 2020-08-30 Epidemic trend analysis, timeline progression, prediction, and recommendation are critical for initiating effective public health control strategies, and AI and data analytics play an important role in epidemiology, diagnostic, and clinical fronts. The focus of this book is data analytics for COVID-19, which includes an overview of COVID-19 in terms of epidemic/pandemic, data processing and knowledge extraction. Data sources, storage and platforms are discussed along with discussions on data models, their performance, different big data techniques, tools and technologies. This book also addresses the challenges in applying analytics to pandemic scenarios, case studies and control strategies. Aimed at Data Analysts, Epidemiologists and associated researchers, this book: discusses challenges of AI model for big data analytics in pandemic scenarios; explains how different big data analytics techniques can be implemented; provides a set of recommendations to minimize infection rate of COVID-19; summarizes various techniques of data processing and knowledge extraction; enables users to understand big data analytics techniques required for prediction purposes. |
data analytics case studies: Python Machine Learning Case Studies Danish Haroon, 2017-10-27 Embrace machine learning approaches and Python to enable automatic rendering of rich insights and solve business problems. The book uses a hands-on case study-based approach to crack real-world applications to which machine learning concepts can be applied. These smarter machines will enable your business processes to achieve efficiencies on minimal time and resources. Python Machine Learning Case Studies takes you through the steps to improve business processes and determine the pivotal points that frame strategies. You’ll see machine learning techniques that you can use to support your products and services. Moreover you’ll learn the pros and cons of each of the machine learning concepts to help you decide which one best suits your needs. By taking a step-by-step approach to coding in Python you’ll be able to understand the rationale behind model selection and decisions within the machine learning process. The book is equipped with practical examples along with code snippets to ensure that you understand the data science approach to solving real-world problems. What You Will Learn Gain insights into machine learning concepts Work on real-world applications of machine learning Learn concepts of model selection and optimization Get a hands-on overview of Python from a machine learning point of view Who This Book Is For Data scientists, data analysts, artificial intelligence engineers, big data enthusiasts, computer scientists, computer sciences students, and capital market analysts. |
data analytics case studies: Data Mining with R Luis Torgo, 2016-11-30 Data Mining with R: Learning with Case Studies, Second Edition uses practical examples to illustrate the power of R and data mining. Providing an extensive update to the best-selling first edition, this new edition is divided into two parts. The first part will feature introductory material, including a new chapter that provides an introduction to data mining, to complement the already existing introduction to R. The second part includes case studies, and the new edition strongly revises the R code of the case studies making it more up-to-date with recent packages that have emerged in R. The book does not assume any prior knowledge about R. Readers who are new to R and data mining should be able to follow the case studies, and they are designed to be self-contained so the reader can start anywhere in the document. The book is accompanied by a set of freely available R source files that can be obtained at the book’s web site. These files include all the code used in the case studies, and they facilitate the do-it-yourself approach followed in the book. Designed for users of data analysis tools, as well as researchers and developers, the book should be useful for anyone interested in entering the world of R and data mining. About the Author Luís Torgo is an associate professor in the Department of Computer Science at the University of Porto in Portugal. He teaches Data Mining in R in the NYU Stern School of Business’ MS in Business Analytics program. An active researcher in machine learning and data mining for more than 20 years, Dr. Torgo is also a researcher in the Laboratory of Artificial Intelligence and Data Analysis (LIAAD) of INESC Porto LA. |
data analytics case studies: Data Management and Analysis Reda Alhajj, Mohammad Moshirpour, Behrouz Far, 2019-12-20 Data management and analysis is one of the fastest growing and most challenging areas of research and development in both academia and industry. Numerous types of applications and services have been studied and re-examined in this field resulting in this edited volume which includes chapters on effective approaches for dealing with the inherent complexity within data management and analysis. This edited volume contains practical case studies, and will appeal to students, researchers and professionals working in data management and analysis in the business, education, healthcare, and bioinformatics areas. |
data analytics case studies: Machine Learning and Data Science in the Oil and Gas Industry Patrick Bangert, 2021-03-04 Machine Learning and Data Science in the Oil and Gas Industry explains how machine learning can be specifically tailored to oil and gas use cases. Petroleum engineers will learn when to use machine learning, how it is already used in oil and gas operations, and how to manage the data stream moving forward. Practical in its approach, the book explains all aspects of a data science or machine learning project, including the managerial parts of it that are so often the cause for failure. Several real-life case studies round out the book with topics such as predictive maintenance, soft sensing, and forecasting. Viewed as a guide book, this manual will lead a practitioner through the journey of a data science project in the oil and gas industry circumventing the pitfalls and articulating the business value. - Chart an overview of the techniques and tools of machine learning including all the non-technological aspects necessary to be successful - Gain practical understanding of machine learning used in oil and gas operations through contributed case studies - Learn change management skills that will help gain confidence in pursuing the technology - Understand the workflow of a full-scale project and where machine learning benefits (and where it does not) |
data analytics case studies: Introduction to Data Science Rafael A. Irizarry, 2019-11-20 Introduction to Data Science: Data Analysis and Prediction Algorithms with R introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression, and machine learning. It also helps you develop skills such as R programming, data wrangling, data visualization, predictive algorithm building, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation. This book is a textbook for a first course in data science. No previous knowledge of R is necessary, although some experience with programming may be helpful. The book is divided into six parts: R, data visualization, statistics with R, data wrangling, machine learning, and productivity tools. Each part has several chapters meant to be presented as one lecture. The author uses motivating case studies that realistically mimic a data scientist’s experience. He starts by asking specific questions and answers these through data analysis so concepts are learned as a means to answering the questions. Examples of the case studies included are: US murder rates by state, self-reported student heights, trends in world health and economics, the impact of vaccines on infectious disease rates, the financial crisis of 2007-2008, election forecasting, building a baseball team, image processing of hand-written digits, and movie recommendation systems. The statistical concepts used to answer the case study questions are only briefly introduced, so complementing with a probability and statistics textbook is highly recommended for in-depth understanding of these concepts. If you read and understand the chapters and complete the exercises, you will be prepared to learn the more advanced concepts and skills needed to become an expert. |
data analytics case studies: The Power of People Nigel Guenole, Jonathan Ferrar, Sheri Feinzig, 2017-05-19 Learn from Today’s Most Successful Workforce Analytics Leaders Transforming the immense potential of workforce analytics into reality isn’t easy. Pioneering practitioners have learned crucial lessons that can help you succeed. The Power of People shares their journeys—and their indispensable insights. Drawing on incisive case studies and vignettes, three experts help you bring purpose and clarity to any workforce analytics project, with robust research design and analysis to get reliable insights. They reveal where to start, where to find stakeholder support, and how to earn “quick wins” to build upon. You’ll learn how to sustain success through best-practice data management, technology usage, partnering, and skill building. Finally, you’ll discover how to earn even more value by establishing an analytical mindset throughout HR, and building two key skills: storytelling and visualization. The Power of People will be invaluable to HR executives establishing or leading analytics functions; HR professionals planning analytics projects; and any business executive who wants more value from HR. |
data analytics case studies: Build a Career in Data Science Emily Robinson, Jacqueline Nolis, 2020-03-24 Summary You are going to need more than technical knowledge to succeed as a data scientist. Build a Career in Data Science teaches you what school leaves out, from how to land your first job to the lifecycle of a data science project, and even how to become a manager. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology What are the keys to a data scientist’s long-term success? Blending your technical know-how with the right “soft skills” turns out to be a central ingredient of a rewarding career. About the book Build a Career in Data Science is your guide to landing your first data science job and developing into a valued senior employee. By following clear and simple instructions, you’ll learn to craft an amazing resume and ace your interviews. In this demanding, rapidly changing field, it can be challenging to keep projects on track, adapt to company needs, and manage tricky stakeholders. You’ll love the insights on how to handle expectations, deal with failures, and plan your career path in the stories from seasoned data scientists included in the book. What's inside Creating a portfolio of data science projects Assessing and negotiating an offer Leaving gracefully and moving up the ladder Interviews with professional data scientists About the reader For readers who want to begin or advance a data science career. About the author Emily Robinson is a data scientist at Warby Parker. Jacqueline Nolis is a data science consultant and mentor. Table of Contents: PART 1 - GETTING STARTED WITH DATA SCIENCE 1. What is data science? 2. Data science companies 3. Getting the skills 4. Building a portfolio PART 2 - FINDING YOUR DATA SCIENCE JOB 5. The search: Identifying the right job for you 6. The application: Résumés and cover letters 7. The interview: What to expect and how to handle it 8. The offer: Knowing what to accept PART 3 - SETTLING INTO DATA SCIENCE 9. The first months on the job 10. Making an effective analysis 11. Deploying a model into production 12. Working with stakeholders PART 4 - GROWING IN YOUR DATA SCIENCE ROLE 13. When your data science project fails 14. Joining the data science community 15. Leaving your job gracefully 16. Moving up the ladder |
data analytics case studies: Data Preparation for Analytics Using SAS Gerhard Svolba, 2006-11-27 Written for anyone involved in the data preparation process for analytics, Gerhard Svolba's Data Preparation for Analytics Using SAS offers practical advice in the form of SAS coding tips and tricks, and provides the reader with a conceptual background on data structures and considerations from a business point of view. The tasks addressed include viewing analytic data preparation in the context of its business environment, identifying the specifics of predictive modeling for data mart creation, understanding the concepts and considerations of data preparation for time series analysis, using various SAS procedures and SAS Enterprise Miner for scoring, creating meaningful derived variables for all data mart types, using powerful SAS macros to make changes among the various data mart structures, and more! |
data analytics case studies: High-Performance Big-Data Analytics Pethuru Raj, Anupama Raman, Dhivya Nagaraj, Siddhartha Duggirala, 2015-10-16 This book presents a detailed review of high-performance computing infrastructures for next-generation big data and fast data analytics. Features: includes case studies and learning activities throughout the book and self-study exercises in every chapter; presents detailed case studies on social media analytics for intelligent businesses and on big data analytics (BDA) in the healthcare sector; describes the network infrastructure requirements for effective transfer of big data, and the storage infrastructure requirements of applications which generate big data; examines real-time analytics solutions; introduces in-database processing and in-memory analytics techniques for data mining; discusses the use of mainframes for handling real-time big data and the latest types of data management systems for BDA; provides information on the use of cluster, grid and cloud computing systems for BDA; reviews the peer-to-peer techniques and tools and the common information visualization techniques, used in BDA. |
data analytics case studies: Anonymizing Health Data Khaled El Emam, Luk Arbuckle, 2013-12-11 Updated as of August 2014, this practical book will demonstrate proven methods for anonymizing health data to help your organization share meaningful datasets, without exposing patient identity. Leading experts Khaled El Emam and Luk Arbuckle walk you through a risk-based methodology, using case studies from their efforts to de-identify hundreds of datasets. Clinical data is valuable for research and other types of analytics, but making it anonymous without compromising data quality is tricky. This book demonstrates techniques for handling different data types, based on the authors’ experiences with a maternal-child registry, inpatient discharge abstracts, health insurance claims, electronic medical record databases, and the World Trade Center disaster registry, among others. Understand different methods for working with cross-sectional and longitudinal datasets Assess the risk of adversaries who attempt to re-identify patients in anonymized datasets Reduce the size and complexity of massive datasets without losing key information or jeopardizing privacy Use methods to anonymize unstructured free-form text data Minimize the risks inherent in geospatial data, without omitting critical location-based health information Look at ways to anonymize coding information in health data Learn the challenge of anonymously linking related datasets |
data analytics case studies: Text Analytics for Business Decisions Andres Fortino, 2021-05-13 With the rise in data science development, we now have many remarkable techniques and tools to extend data analysis from numeric and categorical data to textual data. Sifting through the open-ended responses from a survey, for example, was an arduous process when performed by hand. Using a case study approach, this book was written for business analysts who wish to increase their skills in extracting answers for text data in order to support business decision making. Most of the exercises use Excel, today’s most common analysis tool, and R, a popular analytic computer environment. The techniques covered range from the most basic text analytics, such as key word analysis, to more sophisticated techniques, such as topic extraction and text similarity scoring. Companion files with numerous datasets are included for use with case studies and exercises. FEATURES: Organized by tool or technique, with the basic techniques presented first and the more sophisticated techniques presented later Uses Excel and R for datasets in case studies and exercises Features the CRISP-DM data mining standard with early chapters for conducting the preparatory steps in data mining Companion files with numerous datasets and figures from the text. The companion files are available online by emailing the publisher with proof of purchase at info@merclearning.com. |
data analytics case studies: The role of data for digital markets contestability Jan Krämer, Daniel Schnurr, Sally Broughton Micova, 2020-09-09 This report analyses the processes that turn data into economic value for online search, e-commerce and media platforms. It concludes that forcing data sharing through policy intervention would not prevent dominant incumbents to continue to benefit economically from greater access to data over new entrants. Instead, policy makers should focus on enabling niche entry, niche growth and a level playing field for competitors in new and emerging markets. Data play a central role in the business models that shape competition and innovation in digital markets. As dominant providers of online services collect ever more user data they generate data-driven network effects. They can then improve their services faster, and venture faster into related markets than competitors with less data, thereby raising entry barriers for innovative start-ups. The authors, Sally Broughton Micova (CERRE & University of East Anglia), Jan Krämer (CERRE & University of Passau) and Daniel Schnurr (University of Passau), have analysed processes that transform data into economic value for online search, e-commerce and media platforms. They find that in each case, more data, especially on user behaviour, gradually improves the quality of the service, thereby generating high economic benefits for the firm. The authors find that data-driven network effects can nevertheless be a source of efficiency which can ultimately benefit consumers. Even if some data is shared through policy intervention, dominant incumbents will continue to benefit economically and competitively from greater access to data over new entrants. “We conclude that it is neither realistic nor desirable to try to break data-driven network effects through policy intervention. Instead, we would strongly encourage policy makers to focus on enabling niche entry and niche growth. To do so, they should facilitate the sharing of behavioural user data gathered by the dominant firm with other firms.” The authors provide policy recommendations for data access remedies to safeguard competition, innovation and the openness of the digital ecosystem: 1. Remedies that achieve a more level playing field in the digital economy by breaking the data-driven network effects of data-rich incumbents should be entertained as a last resort and only under specific conditions. 2. Policy makers should foster data sharing on two levels to strike a balance between consumers’ privacy, competition and innovation. They should require the sharing of aggregated and anonymised raw user data in bulk, after a careful review and on a case-by-case basis. They should also facilitate the sharing of detailed raw user data through improved data portability, based on individual users’ consent. Bulk sharing of raw user data should be limited to data that was collected as a by-product of the incumbent’s dominant user-facing service, such as search logs, in order to maintain incentives for innovation and data collection. The main challenge will be to balance privacy concerns with maintaining enough detailed data to ensure it is of value to third-parties. 3. Dominant firms should also be obliged to allow consumers to port their raw data to another provider continuously and in real time. Privacy concerns can then be overcome and the shared user profiles can be more detailed than under bulk sharing. In concert with bulk-sharing, data portability can be a valuable source for attaining both detailed and representative data sets. |
data analytics case studies: Data Science Projects with Python Stephen Klosterman, 2019-04-30 Gain hands-on experience with industry-standard data analysis and machine learning tools in Python Key FeaturesTackle data science problems by identifying the problem to be solvedIllustrate patterns in data using appropriate visualizationsImplement suitable machine learning algorithms to gain insights from dataBook Description Data Science Projects with Python is designed to give you practical guidance on industry-standard data analysis and machine learning tools, by applying them to realistic data problems. You will learn how to use pandas and Matplotlib to critically examine datasets with summary statistics and graphs, and extract the insights you seek to derive. You will build your knowledge as you prepare data using the scikit-learn package and feed it to machine learning algorithms such as regularized logistic regression and random forest. You’ll discover how to tune algorithms to provide the most accurate predictions on new and unseen data. As you progress, you’ll gain insights into the working and output of these algorithms, building your understanding of both the predictive capabilities of the models and why they make these predictions. By then end of this book, you will have the necessary skills to confidently use machine learning algorithms to perform detailed data analysis and extract meaningful insights from unstructured data. What you will learnInstall the required packages to set up a data science coding environmentLoad data into a Jupyter notebook running PythonUse Matplotlib to create data visualizationsFit machine learning models using scikit-learnUse lasso and ridge regression to regularize your modelsCompare performance between models to find the best outcomesUse k-fold cross-validation to select model hyperparametersWho this book is for If you are a data analyst, data scientist, or business analyst who wants to get started using Python and machine learning techniques to analyze data and predict outcomes, this book is for you. Basic knowledge of Python and data analytics will help you get the most from this book. Familiarity with mathematical concepts such as algebra and basic statistics will also be useful. |
data analytics case studies: Data Analytics for IT Networks John Garrett, 2018-10-24 Use data analytics to drive innovation and value throughout your network infrastructure Network and IT professionals capture immense amounts of data from their networks. Buried in this data are multiple opportunities to solve and avoid problems, strengthen security, and improve network performance. To achieve these goals, IT networking experts need a solid understanding of data science, and data scientists need a firm grasp of modern networking concepts. Data Analytics for IT Networks fills these knowledge gaps, allowing both groups to drive unprecedented value from telemetry, event analytics, network infrastructure metadata, and other network data sources. Drawing on his pioneering experience applying data science to large-scale Cisco networks, John Garrett introduces the specific data science methodologies and algorithms network and IT professionals need, and helps data scientists understand contemporary network technologies, applications, and data sources. After establishing this shared understanding, Garrett shows how to uncover innovative use cases that integrate data science algorithms with network data. He concludes with several hands-on, Python-based case studies reflecting Cisco Customer Experience (CX) engineers’ supporting its largest customers. These are designed to serve as templates for developing custom solutions ranging from advanced troubleshooting to service assurance. Understand the data analytics landscape and its opportunities in Networking See how elements of an analytics solution come together in the practical use cases Explore and access network data sources, and choose the right data for your problem Innovate more successfully by understanding mental models and cognitive biases Walk through common analytics use cases from many industries, and adapt them to your environment Uncover new data science use cases for optimizing large networks Master proven algorithms, models, and methodologies for solving network problems Adapt use cases built with traditional statistical methods Use data science to improve network infrastructure analysisAnalyze control and data planes with greater sophistication Fully leverage your existing Cisco tools to collect, analyze, and visualize data |
data analytics case studies: Data Analytics for Organisational Development Uwe H. Kaufmann, Amy B. C. Tan, 2021-07-26 A practical guide for anyone who aspires to become data analytics–savvy Data analytics has become central to the operation of most businesses, making it an increasingly necessary skill for every manager and for all functions across an organisation. Data Analytics for Organisational Development: Unleashing the Potential of Your Data introduces a methodical process for gathering, screening, transforming, and analysing the correct datasets to ensure that they are reliable tools for business decision-making. Written by a Six Sigma Master Black Belt and a Lean Six Sigma Black Belt, this accessible guide explains and illustrates the application of data analytics for organizational development and design, with particular focus on Customer and Strategy Analytics, Operations Analytics and Workforce Analytics. Designed as both a handbook and workbook, Data Analytics for Organisational Development presents the application of data analytics for organizational design and development using case studies and practical examples. It aims to help build a bridge between data scientists, who have less exposure to actual business issues, and the non-data scientists. With this guide, anyone can learn to perform data analytics tasks from translating a business question into a data science hypothesis to understanding the data science results and making the appropriate decisions. From data acquisition, cleaning, and transformation to analysis and decision making, this book covers it all. It also helps you avoid the pitfalls of unsound decision making, no matter where in the value chain you work. Follow the “Five Steps of a Data Analytics Case” to arrive at the correct business decision based on sound data analysis Become more proficient in effectively communicating and working with the data experts, even if you have no background in data science Learn from cases and practical examples that demonstrate a systematic method for gathering and processing data accurately Work through end-of-chapter exercises to review key concepts and apply methods using sample data sets Data Analytics for Organisational Development includes downloadable tools for learning enrichment, including spreadsheets, Power BI slides, datasets, R analysis steps and more. Regardless of your level in your organisation, this book will help you become savvy with data analytics, one of today’s top business tools. |
data analytics case studies: Big Data on Campus Karen L. Webber, Henry Y. Zheng, 2020-11-03 Webber, Henry Y. Zheng, Ying Zhou |
data analytics case studies: Cracking the Data Science Interview Maverick Lin, 2019-12-17 Cracking the Data Science Interview is the first book that attempts to capture the essence of data science in a concise, compact, and clean manner. In a Cracking the Coding Interview style, Cracking the Data Science Interview first introduces the relevant concepts, then presents a series of interview questions to help you solidify your understanding and prepare you for your next interview. Topics include: - Necessary Prerequisites (statistics, probability, linear algebra, and computer science) - 18 Big Ideas in Data Science (such as Occam's Razor, Overfitting, Bias/Variance Tradeoff, Cloud Computing, and Curse of Dimensionality) - Data Wrangling (exploratory data analysis, feature engineering, data cleaning and visualization) - Machine Learning Models (such as k-NN, random forests, boosting, neural networks, k-means clustering, PCA, and more) - Reinforcement Learning (Q-Learning and Deep Q-Learning) - Non-Machine Learning Tools (graph theory, ARIMA, linear programming) - Case Studies (a look at what data science means at companies like Amazon and Uber) Maverick holds a bachelor's degree from the College of Engineering at Cornell University in operations research and information engineering (ORIE) and a minor in computer science. He is the author of the popular Data Science Cheatsheet and Data Engineering Cheatsheet on GCP and has previous experience in data science consulting for a Fortune 500 company focusing on fraud analytics. |
data analytics case studies: Industry 4.0 Interoperability, Analytics, Security, and Case Studies G. Rajesh, X. Mercilin Raajini, Hien Dang, 2021-01-30 All over the world, vast research is in progress on the domain of Industry 4.0 and related techniques. Industry 4.0 is expected to have a very high impact on labor markets, global value chains, education, health, environment, and many social economic aspects. Industry 4.0 Interoperability, Analytics, Security, and Case Studies provides a deeper understanding of the drivers and enablers of Industry 4.0. It includes real case studies of various applications related to different fields, such as cyber physical systems (CPS), Internet of Things (IoT), cloud computing, machine learning, virtualization, decentralization, blockchain, fog computing, and many other related areas. Also discussed are interoperability, design, and implementation challenges. Researchers, academicians, and those working in industry around the globe will find this book of interest. FEATURES Provides an understanding of the drivers and enablers of Industry 4.0 Includes real case studies of various applications for different fields Discusses technologies such as cyber physical systems (CPS), Internet of Things (IoT), cloud computing, machine learning, virtualization, decentralization, blockchain, fog computing, and many other related areas Covers design, implementation challenges, and interoperability Offers detailed knowledge on Industry 4.0 and its underlying technologies, research challenges, solutions, and case studies |
data analytics case studies: Big Data and Analytics Vincenzo Morabito, 2015-01-31 This book presents and discusses the main strategic and organizational challenges posed by Big Data and analytics in a manner relevant to both practitioners and scholars. The first part of the book analyzes strategic issues relating to the growing relevance of Big Data and analytics for competitive advantage, which is also attributable to empowerment of activities such as consumer profiling, market segmentation, and development of new products or services. Detailed consideration is also given to the strategic impact of Big Data and analytics on innovation in domains such as government and education and to Big Data-driven business models. The second part of the book addresses the impact of Big Data and analytics on management and organizations, focusing on challenges for governance, evaluation, and change management, while the concluding part reviews real examples of Big Data and analytics innovation at the global level. The text is supported by informative illustrations and case studies, so that practitioners can use the book as a toolbox to improve understanding and exploit business opportunities related to Big Data and analytics. |
data analytics case studies: Machine Learning and Data Science in the Power Generation Industry Patrick Bangert, 2021-01-14 Machine Learning and Data Science in the Power Generation Industry explores current best practices and quantifies the value-add in developing data-oriented computational programs in the power industry, with a particular focus on thoughtfully chosen real-world case studies. It provides a set of realistic pathways for organizations seeking to develop machine learning methods, with a discussion on data selection and curation as well as organizational implementation in terms of staffing and continuing operationalization. It articulates a body of case study–driven best practices, including renewable energy sources, the smart grid, and the finances around spot markets, and forecasting. - Provides best practices on how to design and set up ML projects in power systems, including all nontechnological aspects necessary to be successful - Explores implementation pathways, explaining key ML algorithms and approaches as well as the choices that must be made, how to make them, what outcomes may be expected, and how the data must be prepared for them - Determines the specific data needs for the collection, processing, and operationalization of data within machine learning algorithms for power systems - Accompanied by numerous supporting real-world case studies, providing practical evidence of both best practices and potential pitfalls |
data analytics case studies: Data Analytics and AI Jay Liebowitz, 2020-08-06 Analytics and artificial intelligence (AI), what are they good for? The bandwagon keeps answering, absolutely everything! Analytics and artificial intelligence have captured the attention of everyone from top executives to the person in the street. While these disciplines have a relatively long history, within the last ten or so years they have exploded into corporate business and public consciousness. Organizations have rushed to embrace data-driven decision making. Companies everywhere are turning out products boasting that artificial intelligence is included. We are indeed living in exciting times. The question we need to ask is, do we really know how to get business value from these exciting tools? Unfortunately, both the analytics and AI communities have not done a great job in collaborating and communicating with each other to build the necessary synergies. This book bridges the gap between these two critical fields. The book begins by explaining the commonalities and differences in the fields of data science, artificial intelligence, and autonomy by giving a historical perspective for each of these fields, followed by exploration of common technologies and current trends in each field. The book also readers introduces to applications of deep learning in industry with an overview of deep learning and its key architectures, as well as a survey and discussion of the main applications of deep learning. The book also presents case studies to illustrate applications of AI and analytics. These include a case study from the healthcare industry and an investigation of a digital transformation enabled by AI and analytics transforming a product-oriented company into one delivering solutions and services. The book concludes with a proposed AI-informed data analytics life cycle to be applied to unstructured data. |
data analytics case studies: Pro Hadoop Data Analytics Kerry Koitzsch, 2016-12-29 Learn advanced analytical techniques and leverage existing tool kits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems that go beyond the basics of classification, clustering, and recommendation. Pro Hadoop Data Analytics emphasizes best practices to ensure coherent, efficient development. A complete example system will be developed using standard third-party components that consist of the tool kits, libraries, visualization and reporting code, as well as support glue to provide a working and extensible end-to-end system. The book also highlights the importance of end-to-end, flexible, configurable, high-performance data pipeline systems with analytical components as well as appropriate visualization results. You'll discover the importance of mix-and-match or hybrid systems, using different analytical components in one application. This hybrid approach will be prominent in the examples. What You'll Learn Build big data analytic systems with the Hadoop ecosystem Use libraries, tool kits, and algorithms to make development easier and more effective Apply metrics to measure performance and efficiency of components and systems Connect to standard relational databases, noSQL data sources, and more Follow case studies with example components to create your own systems Who This Book Is For Software engineers, architects, and data scientists with an interest in the design and implementation of big data analytical systems using Hadoop, the Hadoop ecosystem, and other associated technologies. |
data analytics case studies: The Big Data Revolution Jason Kolb, Jeremy Kolb, 2013 We create more data in a day then we did from the dawn of man through 2003 and approximately 90% of all the world's data has been created in the past 2 years. What does this mean to you? In The Big Data Revolution we explore this very question and reveal the data secrets your competitors don't want you to know. Our world is transforming as the data deluge knocks us out of our old ways and into the data driven reality. Some companies are winning by taking advantages of the opportunities in this evolving world while others are falling behind. Pioneers like Amazon, Target, and Google are blazing a trail that we can follow, and in The Big Data Revolution we help you do just that. Big Data promises to give us a world driven by information and solid data, bringing far greater productivity, increased profits, and lower costs; and in The Big Data Revolution we explore those winning strategies and techniques and the tools behind them. Want to learn how companies like Amazon, Target, and IBM use data to gain competitive advantages? Or how Obama used Big Data tools to better utilize his resources? The Big Data Revolution was written for the non-or-only-slightly-technical business person in mind--but in a way that gives you enough meat behind the ideas so that you have a road map that tells you how to get where you want to go. It uses real-world examples and case studies to illustrates the concepts and explore the technology that makes them happen. The Big Data Revolution is comprised of four parts: Part 1: Data Science In Part 1 we first introduce you to the world of data science and analytics. These are the tools companies and governments use to refine their crude data into valuable insights. In this section, we'll look at the magic behind Amazon's success, and see how data is leading towards a near Minority Report future. Part 2: Big Data Data is growing at an exceptional rate, we produce more data now in a day than we did from the dawn of man till 2003. This explosion of data creates many unique struggles as well as opportunities. In this section we'll look at how Obama invested in Big Data during his presidential campaign, and explore how startups are revealing data that saves their clients substantial capital. Part 3: Tools of the trade Data Scientists cannot just look at big data and get value from it, it doesn't matter how good they are. The data is just too big. So companies like IBM and Microsoft build tools that help people make sense of data, and hopefully discover new useful insights from it. The two primary categories of tools you need to be aware of are Business Intelligence and Data Discovery. In this section we explore these broad terms, and show how companies are designing more specialized tools for specific purposes. Part 4: Gazing into the Future In order to position yourself well for what is to come you need to know where we are now and almost more importantly where we are going to be in the near future. In this section we explore the trends that are going to matter as we move forward in this emerging technology industry. Computerized Data Analytics is truly still in its early stages of development, and things are going to change as new innovations come to the forefront. If we are serious about gaining the data advantage, we need to stay ahead of this curve. The Big Data Revolution is your tool to understanding this complex new reality of your world. Get it today and don't miss out on the data driven future. The world is changing. Are you ready? |
data analytics case studies: Solving Modern Crime in Financial Markets Marius-Cristian Frunza, 2015-12-09 This comprehensive source of information about financial fraud delivers a mature approach to fraud detection and prevention. It brings together all important aspect of analytics used in investigating modern crime in financial markets and uses R for its statistical examples. It focuses on crime in financial markets as opposed to the financial industry, and it highlights technical aspects of crime detection and prevention as opposed to their qualitative aspects. For those with strong analytic skills, this book unleashes the usefulness of powerful predictive and prescriptive analytics in predicting and preventing modern crime in financial markets. - Interviews and case studies provide context and depth to examples - Case studies use R, the powerful statistical freeware tool - Useful in classroom and professional contexts |
data analytics case studies: Data Science and Predictive Analytics Ivo D. Dinov, 2023-02-16 This textbook integrates important mathematical foundations, efficient computational algorithms, applied statistical inference techniques, and cutting-edge machine learning approaches to address a wide range of crucial biomedical informatics, health analytics applications, and decision science challenges. Each concept in the book includes a rigorous symbolic formulation coupled with computational algorithms and complete end-to-end pipeline protocols implemented as functional R electronic markdown notebooks. These workflows support active learning and demonstrate comprehensive data manipulations, interactive visualizations, and sophisticated analytics. The content includes open problems, state-of-the-art scientific knowledge, ethical integration of heterogeneous scientific tools, and procedures for systematic validation and dissemination of reproducible research findings. Complementary to the enormous challenges related to handling, interrogating, and understanding massive amounts of complex structured and unstructured data, there are unique opportunities that come with access to a wealth of feature-rich, high-dimensional, and time-varying information. The topics covered in Data Science and Predictive Analytics address specific knowledge gaps, resolve educational barriers, and mitigate workforce information-readiness and data science deficiencies. Specifically, it provides a transdisciplinary curriculum integrating core mathematical principles, modern computational methods, advanced data science techniques, model-based machine learning, model-free artificial intelligence, and innovative biomedical applications. The book’s fourteen chapters start with an introduction and progressively build foundational skills from visualization to linear modeling, dimensionality reduction, supervised classification, black-box machine learning techniques, qualitative learning methods, unsupervised clustering, model performance assessment, feature selection strategies, longitudinal data analytics, optimization, neural networks, and deep learning. The second edition of the book includes additional learning-based strategies utilizing generative adversarial networks, transfer learning, and synthetic data generation, as well as eight complementary electronic appendices. This textbook is suitable for formal didactic instructor-guided course education, as well as for individual or team-supported self-learning. The material is presented at the upper-division and graduate-level college courses and covers applied and interdisciplinary mathematics, contemporary learning-based data science techniques, computational algorithm development, optimization theory, statistical computing, and biomedical sciences. The analytical techniques and predictive scientific methods described in the book may be useful to a wide range of readers, formal and informal learners, college instructors, researchers, and engineers throughout the academy, industry, government, regulatory, funding, and policy agencies. The supporting book website provides many examples, datasets, functional scripts, complete electronic notebooks, extensive appendices, and additional materials. |
data analytics case studies: Quantifying the Qualitative Katya Drozdova, Kurt Taylor Gaubatz, 2015-12-30 Quantifying the Qualitative by Katya Drozdova and Kurt Taylor Gaubatz presents a systematic approach to comparative case analysis based on insights from information theory. This new method, which requires minimal quantitative skills, helps students, policymakers, professionals, and scholars learn more from comparative cases. The approach avoids the limitations of traditional statistics in the small-n context and allows analysts to systematically assess and compare the impact of a set of factors on case outcomes with easy-to-use analytics. Rigorous tools reduce bias, improve the knowledge gained from case studies, and provide straightforward metrics for effectively communicating results to a range of readers and leaders. |
data analytics case studies: Winning with Data Tomasz Tunguz, Frank Bien, 2016-06-20 Crest the data wave with a deep cultural shift Winning with Data explores the cultural changes big data brings to business, and shows you how to adapt your organization to leverage data to maximum effect. Authors Tomasz Tunguz and Frank Bien draw on extensive background in big data, business intelligence, and business strategy to provide a blueprint for companies looking to move head-on into the data wave. Instrumentation is discussed in detail, but the core of the change is in the culture—this book provides sound guidance on building the type of organizational culture that creates and leverages data daily, in every aspect of the business. Real-world examples illustrate these important concepts at work: you'll learn how data helped Warby-Parker disrupt a $13 billion monopolized market, how ThredUp uses data to process more than 20 thousand items of clothing every day, how Venmo leverages data to build better products, how HubSpot empowers their salespeople to be more productive, and more. From decision making and strategy to shipping and sales, this book shows you how data makes better business. Big data has taken on buzzword status, but there is little real guidance for companies seeking everyday business data solutions. This book takes a deeper look at big data in business, and shows you how to shift internal culture ahead of the curve. Understand the changes a data culture brings to companies Instrument your company for maximum benefit Utilize data to optimize every aspect of your business Improve decision making and transform business strategy Big data is becoming the number-one topic in business, yet no one is asking the right questions. Leveraging the full power of data requires more than good IT—organization-wide buy-in is essential for long-term success. Winning with Data is the expert guide to making data work for your business, and your needs. |
data analytics case studies: Analytics Across the Enterprise Brenda L. Dietrich, Emily C. Plachy, Maureen F. Norton, 2014-05-15 How to Transform Your Organization with Analytics: Insider Lessons from IBM’s Pioneering Experience Analytics is not just a technology: It is a better way to do business. Using analytics, you can systematically inform human judgment with data-driven insight. This doesn’t just improve decision-making: It also enables greater innovation and creativity in support of strategy. Your transformation won’t happen overnight; however, it is absolutely achievable, and the rewards are immense. This book demystifies your analytics journey by showing you how IBM has successfully leveraged analytics across the enterprise, worldwide. Three of IBM’s pioneering analytics practitioners share invaluable real-world perspectives on what does and doesn’t work and how you can start or accelerate your own transformation. This book provides an essential framework for becoming a smarter enterprise and shows through 31 case studies how IBM has derived value from analytics throughout its business. Coverage Includes Creating a smarter workforce through big data and analytics More effectively optimizing supply chain processes Systematically improving financial forecasting Managing financial risk, increasing operational efficiency, and creating business value Reaching more B2B or B2C customers and deepening their engagement Optimizing manufacturing and product management processes Deploying your sales organization to increase revenue and effectiveness Achieving new levels of excellence in services delivery and reducing risk Transforming IT to enable wider use of analytics “Measuring the immeasurable” and filling gaps in imperfect data Whatever your industry or role, whether a current or future leader, analytics can make you smarter and more competitive. Analytics Across the Enterprise shows how IBM did it--and how you can, too. Learn more about IBM Analytics |
data analytics case studies: Contemporary Research Methods and Data Analytics in the News Industry Gibbs, William J., 2015-07-01 The advent of digital technologies has changed the news and publishing industries drastically. While shrinking newsrooms may be a concern for many, journalists and publishing professionals are working to reorient their skills and capabilities to employ technology for the purpose of better understanding and engaging with their audiences. Contemporary Research Methods and Data Analytics in the News Industry highlights the research behind the innovations and emerging practices being implemented within the journalism industry. This crucial, industry-shattering publication focuses on key topics in social media and video streaming as a new form of media communication as well the application of big data and data analytics for collecting information and drawing conclusions about the current and future state of print and digital news. Due to significant insight surrounding the latest applications and technologies affecting the news industry, this publication is a must-have resource for journalists, analysts, news media professionals, social media strategists, researchers, television news producers, and upper-level students in journalism and media studies. This timely industry resource includes key topics on the changing scope of the news and publishing industries including, but not limited to, big data, broadcast journalism, computational journalism, computer-mediated communication, data scraping, digital media, news media, social media, text mining, and user experience. |
data analytics case studies: Adoption of Data Analytics in Higher Education Learning and Teaching Dirk Ifenthaler, David Gibson, 2020-08-10 The book aims to advance global knowledge and practice in applying data science to transform higher education learning and teaching to improve personalization, access and effectiveness of education for all. Currently, higher education institutions and involved stakeholders can derive multiple benefits from educational data mining and learning analytics by using different data analytics strategies to produce summative, real-time, and predictive or prescriptive insights and recommendations. Educational data mining refers to the process of extracting useful information out of a large collection of complex educational datasets while learning analytics emphasizes insights and responses to real-time learning processes based on educational information from digital learning environments, administrative systems, and social platforms. This volume provides insight into the emerging paradigms, frameworks, methods and processes of managing change to better facilitate organizational transformation toward implementation of educational data mining and learning analytics. It features current research exploring the (a) theoretical foundation and empirical evidence of the adoption of learning analytics, (b) technological infrastructure and staff capabilities required, as well as (c) case studies that describe current practices and experiences in the use of data analytics in higher education. |
data analytics case studies: Applying Predictive Analytics Richard V. McCarthy, Mary M. McCarthy, Wendy Ceccucci, Leila Halawi, 2019-03-12 This textbook presents a practical approach to predictive analytics for classroom learning. It focuses on using analytics to solve business problems and compares several different modeling techniques, all explained from examples using the SAS Enterprise Miner software. The authors demystify complex algorithms to show how they can be utilized and explained within the context of enhancing business opportunities. Each chapter includes an opening vignette that provides real-life example of how business analytics have been used in various aspects of organizations to solve issue or improve their results. A running case provides an example of a how to build and analyze a complex analytics model and utilize it to predict future outcomes. |
Data and Digital Outputs Management Plan (DDOMP)
Data and Digital Outputs Management Plan (DDOMP)
Building New Tools for Data Sharing and Reuse through a …
Jan 10, 2019 · The SEI CRA will closely link research thinking and technological innovation toward accelerating the full path of discovery-driven data use and open science. This will enable a …
Open Data Policy and Principles - Belmont Forum
The data policy includes the following principles: Data should be: Discoverable through catalogues and search engines; Accessible as open data by default, and made available with …
Belmont Forum Adopts Open Data Principles for Environmental …
Jan 27, 2016 · Adoption of the open data policy and principles is one of five recommendations in A Place to Stand: e-Infrastructures and Data Management for Global Change Research, …
Belmont Forum Data Accessibility Statement and Policy
The DAS encourages researchers to plan for the longevity, reusability, and stability of the data attached to their research publications and results. Access to data promotes reproducibility, …
Climate-Induced Migration in Africa and Beyond: Big Data and …
CLIMB will also leverage earth observation and social media data, and combine them with survey and official statistical data. This holistic approach will allow us to analyze migration process …
Advancing Resilience in Low Income Housing Using Climate …
Jun 4, 2020 · Environmental sustainability and public health considerations will be included. Machine Learning and Big Data Analytics will be used to identify optimal disaster resilient …
Belmont Forum
What is the Belmont Forum? The Belmont Forum is an international partnership that mobilizes funding of environmental change research and accelerates its delivery to remove critical …
Waterproofing Data: Engaging Stakeholders in Sustainable Flood …
Apr 26, 2018 · Waterproofing Data investigates the governance of water-related risks, with a focus on social and cultural aspects of data practices. Typically, data flows up from local levels to …
Data Management Annex (Version 1.4) - Belmont Forum
A full Data Management Plan (DMP) for an awarded Belmont Forum CRA project is a living, actively updated document that describes the data management life cycle for the data to be …
Data and Digital Outputs Management Plan (DDOMP)
Data and Digital Outputs Management Plan (DDOMP)
Building New Tools for Data Sharing and Reuse through a Transnationa…
Jan 10, 2019 · The SEI CRA will closely link research thinking and technological innovation toward accelerating the full …
Open Data Policy and Principles - Belmont Forum
The data policy includes the following principles: Data should be: Discoverable through catalogues and search engines; …
Belmont Forum Adopts Open Data Principles for Environmental Chan…
Jan 27, 2016 · Adoption of the open data policy and principles is one of five recommendations in A Place to Stand: e …
Belmont Forum Data Accessibility Statement and Policy
The DAS encourages researchers to plan for the longevity, reusability, and stability of the data attached to their research …