Data Science For Social Good

Advertisement



  data science for social good: Data Science for Social Good Massimo Lapucci, Ciro Cattuto, 2021-10-13 This book is a collection of reflections by thought leaders at first-mover organizations in the exploding field of Data Science for Social Good, meant as the application of knowledge from computer science, complex systems and computational social science to challenges such as humanitarian response, public health, sustainable development. The book provides both an overview of scientific approaches to social impact – identifying a social need, targeting an intervention, measuring impact – and the complementary perspective of funders and philanthropies that are pushing forward this new sector. This book will appeal to students and researchers in the rapidly growing field of data science for social impact, to data scientists at companies whose data could be used to generate more public value, and to decision makers at nonprofits, foundations, and agencies that are designing their own agenda around data.
  data science for social good: Big Data and Social Science Ian Foster, Rayid Ghani, Ron S. Jarmin, Frauke Kreuter, Julia Lane, 2020-11-17 Big Data and Social Science: Data Science Methods and Tools for Research and Practice, Second Edition shows how to apply data science to real-world problems, covering all stages of a data-intensive social science or policy project. Prominent leaders in the social sciences, statistics, and computer science as well as the field of data science provide a unique perspective on how to apply modern social science research principles and current analytical and computational tools. The text teaches you how to identify and collect appropriate data, apply data science methods and tools to the data, and recognize and respond to data errors, biases, and limitations. Features: Takes an accessible, hands-on approach to handling new types of data in the social sciences Presents the key data science tools in a non-intimidating way to both social and data scientists while keeping the focus on research questions and purposes Illustrates social science and data science principles through real-world problems Links computer science concepts to practical social science research Promotes good scientific practice Provides freely available workbooks with data, code, and practical programming exercises, through Binder and GitHub New to the Second Edition: Increased use of examples from different areas of social sciences New chapter on dealing with Bias and Fairness in Machine Learning models Expanded chapters focusing on Machine Learning and Text Analysis Revamped hands-on Jupyter notebooks to reinforce concepts covered in each chapter This classroom-tested book fills a major gap in graduate- and professional-level data science and social science education. It can be used to train a new generation of social data scientists to tackle real-world problems and improve the skills and competencies of applied social scientists and public policy practitioners. It empowers you to use the massive and rapidly growing amounts of available data to interpret economic and social activities in a scientific and rigorous manner.
  data science for social good: Big Data and Social Science Ian Foster, Rayid Ghani, Ron S. Jarmin, Frauke Kreuter, Julia Lane, 2016-08-10 Both Traditional Students and Working Professionals Acquire the Skills to Analyze Social Problems. Big Data and Social Science: A Practical Guide to Methods and Tools shows how to apply data science to real-world problems in both research and the practice. The book provides practical guidance on combining methods and tools from computer science, statistics, and social science. This concrete approach is illustrated throughout using an important national problem, the quantitative study of innovation. The text draws on the expertise of prominent leaders in statistics, the social sciences, data science, and computer science to teach students how to use modern social science research principles as well as the best analytical and computational tools. It uses a real-world challenge to introduce how these tools are used to identify and capture appropriate data, apply data science models and tools to that data, and recognize and respond to data errors and limitations. For more information, including sample chapters and news, please visit the author's website.
  data science for social good: Data Analysis for Social Science Elena Llaudet, Kosuke Imai, 2022-11-29 Data analysis has become a necessary skill across the social sciences, and recent advancements in computing power have made knowledge of programming an essential component. Yet most data science books are intimidating and overwhelming to a non-specialist audience, including most undergraduates. This book will be a shorter, more focused and accessible version of Kosuke Imai's Quantitative Social Science book, which was published by Princeton in 2018 and has been adopted widely in graduate level courses of the same title. This book uses the same innovative approach as Quantitative Social Science , using real data and 'R' to answer a wide range of social science questions. It assumes no prior knowledge of statistics or coding. It starts with straightforward, simple data analysis and culminates with multivariate linear regression models, focusing more on the intuition of how the math works rather than the math itself. The book makes extensive use of data visualizations, diagrams, pictures, cartoons, etc., to help students understand and recall complex concepts, provides an easy to follow, step-by-step template of how to conduct data analysis from beginning to end, and will be accompanied by supplemental materials in the appendix and online for both students and instructors--
  data science for social good: Social Sensing Dong Wang, Tarek Abdelzaher, Lance Kaplan, 2015-04-17 Increasingly, human beings are sensors engaging directly with the mobile Internet. Individuals can now share real-time experiences at an unprecedented scale. Social Sensing: Building Reliable Systems on Unreliable Data looks at recent advances in the emerging field of social sensing, emphasizing the key problem faced by application designers: how to extract reliable information from data collected from largely unknown and possibly unreliable sources. The book explains how a myriad of societal applications can be derived from this massive amount of data collected and shared by average individuals. The title offers theoretical foundations to support emerging data-driven cyber-physical applications and touches on key issues such as privacy. The authors present solutions based on recent research and novel ideas that leverage techniques from cyber-physical systems, sensor networks, machine learning, data mining, and information fusion. Offers a unique interdisciplinary perspective bridging social networks, big data, cyber-physical systems, and reliability Presents novel theoretical foundations for assured social sensing and modeling humans as sensors Includes case studies and application examples based on real data sets Supplemental material includes sample datasets and fact-finding software that implements the main algorithms described in the book
  data science for social good: Data Science for Social Good Massimo Lapucci, Ciro Cattuto, 2021 This book is a collection of reflections by thought leaders at first-mover organizations in the exploding field of Data Science for Social Good, meant as the application of knowledge from computer science, complex systems and computational social science to challenges such as humanitarian response, public health, sustainable development. The book provides both an overview of scientific approaches to social impact - identifying a social need, targeting an intervention, measuring impact - and the complementary perspective of funders and philanthropies that are pushing forward this new sector. This book will appeal to students and researchers in the rapidly growing field of data science for social impact, to data scientists at companies whose data could be used to generate more public value, and to decision makers at nonprofits, foundations, and agencies that are designing their own agenda around data.
  data science for social good: Data Feminism Catherine D'Ignazio, Lauren F. Klein, 2020-03-31 A new way of thinking about data science and data ethics that is informed by the ideas of intersectional feminism. Today, data science is a form of power. It has been used to expose injustice, improve health outcomes, and topple governments. But it has also been used to discriminate, police, and surveil. This potential for good, on the one hand, and harm, on the other, makes it essential to ask: Data science by whom? Data science for whom? Data science with whose interests in mind? The narratives around big data and data science are overwhelmingly white, male, and techno-heroic. In Data Feminism, Catherine D'Ignazio and Lauren Klein present a new way of thinking about data science and data ethics—one that is informed by intersectional feminist thought. Illustrating data feminism in action, D'Ignazio and Klein show how challenges to the male/female binary can help challenge other hierarchical (and empirically wrong) classification systems. They explain how, for example, an understanding of emotion can expand our ideas about effective data visualization, and how the concept of invisible labor can expose the significant human efforts required by our automated systems. And they show why the data never, ever “speak for themselves.” Data Feminism offers strategies for data scientists seeking to learn how feminism can help them work toward justice, and for feminists who want to focus their efforts on the growing field of data science. But Data Feminism is about much more than gender. It is about power, about who has it and who doesn't, and about how those differentials of power can be challenged and changed.
  data science for social good: Introduction to Data Science for Social and Policy Research Jose Manuel Magallanes Reyes, 2017-09-21 This comprehensive guide provides a step-by-step approach to data collection, cleaning, formatting, and storage, using Python and R.
  data science for social good: Data Science and Social Research II Paolo Mariani, Mariangela Zenga, 2020-11-25 The peer-reviewed contributions gathered in this book address methods, software and applications of statistics and data science in the social sciences. The data revolution in social science research has not only produced new business models, but has also provided policymakers with better decision-making support tools. In this volume, statisticians, computer scientists and experts on social research discuss the opportunities and challenges of the social data revolution in order to pave the way for addressing new research problems. The respective contributions focus on complex social systems and current methodological advances in extracting social knowledge from large data sets, as well as modern social research on human behavior and society using large data sets. Moreover, they analyze integrated systems designed to take advantage of new social data sources, and discuss quality-related issues. The papers were originally presented at the 2nd International Conference on Data Science and Social Research, held in Milan, Italy, on February 4-5, 2019.
  data science for social good: Data Science for Undergraduates National Academies of Sciences, Engineering, and Medicine, Division of Behavioral and Social Sciences and Education, Board on Science Education, Division on Engineering and Physical Sciences, Committee on Applied and Theoretical Statistics, Board on Mathematical Sciences and Analytics, Computer Science and Telecommunications Board, Committee on Envisioning the Data Science Discipline: The Undergraduate Perspective, 2018-11-11 Data science is emerging as a field that is revolutionizing science and industries alike. Work across nearly all domains is becoming more data driven, affecting both the jobs that are available and the skills that are required. As more data and ways of analyzing them become available, more aspects of the economy, society, and daily life will become dependent on data. It is imperative that educators, administrators, and students begin today to consider how to best prepare for and keep pace with this data-driven era of tomorrow. Undergraduate teaching, in particular, offers a critical link in offering more data science exposure to students and expanding the supply of data science talent. Data Science for Undergraduates: Opportunities and Options offers a vision for the emerging discipline of data science at the undergraduate level. This report outlines some considerations and approaches for academic institutions and others in the broader data science communities to help guide the ongoing transformation of this field.
  data science for social good: R for Data Science Hadley Wickham, Garrett Grolemund, 2016-12-12 Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible. Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You'll get a complete, big-picture understanding of the data science cycle, along with basic tools you need to manage the details. Each section of the book is paired with exercises to help you practice what you've learned along the way. You'll learn how to: Wrangle—transform your datasets into a form convenient for analysis Program—learn powerful R tools for solving data problems with greater clarity and ease Explore—examine your data, generate hypotheses, and quickly test them Model—provide a low-dimensional summary that captures true signals in your dataset Communicate—learn R Markdown for integrating prose, code, and results
  data science for social good: Data Activism and Social Change Miren Gutiérrez, 2018-05-02 This book efficiently contributes to our understanding of the interplay between data, technology and communicative practice on the one hand, and democratic participation on the other. It addresses the emergence of proactive data activism, a new sociotechnical phenomenon in the field of action that arises as a reaction to massive datafication, and makes affirmative use of data for advocacy and social change. By blending empirical observation and in-depth qualitative interviews, Gutiérrez brings to the fore a debate about the social uses of the data infrastructure and examines precisely how people employ it, in combination with other technologies, to collaborate and act for social change.
  data science for social good: Doing Data Science Cathy O'Neil, Rachel Schutt, 2013-10-09 Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that’s so clouded in hype? This insightful book, based on Columbia University’s Introduction to Data Science class, tells you what you need to know. In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you’re familiar with linear algebra, probability, and statistics, and have programming experience, this book is an ideal introduction to data science. Topics include: Statistical inference, exploratory data analysis, and the data science process Algorithms Spam filters, Naive Bayes, and data wrangling Logistic regression Financial modeling Recommendation engines and causality Data visualization Social networks and data journalism Data engineering, MapReduce, Pregel, and Hadoop Doing Data Science is collaboration between course instructor Rachel Schutt, Senior VP of Data Science at News Corp, and data science consultant Cathy O’Neil, a senior data scientist at Johnson Research Labs, who attended and blogged about the course.
  data science for social good: Social Physics Alex Pentland, 2014 A landmark tour of the new science of idea flow outlines revolutionary insights into the mysteries of collective intelligence and social influence, explaining the virtually unlimited data sets of today's digital technologies and the considerable accuracy of information from social networks.
  data science for social good: Oxford Bibliographies Edward J. Mullen, Offers peer-reviewed annotated bibliographies on social work as a discipline grounded in social theory and the improvement of peoples' lives. Bibliographies are browseable by subject area and keyword searchable. Contains a My OBO function that allows users to create personalized bibliographies of individual citations from different bibliographies.
  data science for social good: Workshop Proceedings of the 13th International AAAI Conference on Web and Social Media Emilio Zagheni, Stevie Chancellor, 2020-07-31
  data science for social good: Programming with Python for Social Scientists Phillip D. Brooker, 2019-12-09 As data become ′big′, fast and complex, the software and computing tools needed to manage and analyse them are rapidly developing. Social scientists need new tools to meet these challenges, tackle big datasets, while also developing a more nuanced understanding of - and control over - how these computing tools and algorithms are implemented. Programming with Python for Social Scientists offers a vital foundation to one of the most popular programming tools in computer science, specifically for social science researchers, assuming no prior coding knowledge. It guides you through the full research process, from question to publication, including: the fundamentals of why and how to do your own programming in social scientific research, questions of ethics and research design, a clear, easy to follow ′how-to′ guide to using Python, with a wide array of applications such as data visualisation, social media data research, social network analysis, and more. Accompanied by numerous code examples, screenshots, sample data sources, this is the textbook for social scientists looking for a complete introduction to programming with Python and incorporating it into their research design and analysis.
  data science for social good: A Hands-On Introduction to Data Science Chirag Shah, 2020-04-02 An introductory textbook offering a low barrier entry to data science; the hands-on approach will appeal to students from a range of disciplines.
  data science for social good: Decoding the Social World Sandra Gonzalez-Bailon, 2017-12-22 How data science and the analysis of networks help us solve the puzzle of unintended consequences. Social life is full of paradoxes. Our intentional actions often trigger outcomes that we did not intend or even envision. How do we explain those unintended effects and what can we do to regulate them? In Decoding the Social World, Sandra González-Bailón explains how data science and digital traces help us solve the puzzle of unintended consequences—offering the solution to a social paradox that has intrigued thinkers for centuries. Communication has always been the force that makes a collection of people more than the sum of individuals, but only now can we explain why: digital technologies have made it possible to parse the information we generate by being social in new, imaginative ways. And yet we must look at that data, González-Bailón argues, through the lens of theories that capture the nature of social life. The technologies we use, in the end, are also a manifestation of the social world we inhabit. González-Bailón discusses how the unpredictability of social life relates to communication networks, social influence, and the unintended effects that derive from individual decisions. She describes how communication generates social dynamics in aggregate (leading to episodes of “collective effervescence”) and discusses the mechanisms that underlie large-scale diffusion, when information and behavior spread “like wildfire.” She applies the theory of networks to illuminate why collective outcomes can differ drastically even when they arise from the same individual actions. By opening the black box of unintended effects, González-Bailón identifies strategies for social intervention and discusses the policy implications—and how data science and evidence-based research embolden critical thinking in a world that is constantly changing.
  data science for social good: Foundations of Data Science Avrim Blum, John Hopcroft, Ravindran Kannan, 2020-01-23 This book provides an introduction to the mathematical and algorithmic foundations of data science, including machine learning, high-dimensional geometry, and analysis of large networks. Topics include the counterintuitive nature of data in high dimensions, important linear algebraic techniques such as singular value decomposition, the theory of random walks and Markov chains, the fundamentals of and important algorithms for machine learning, algorithms and analysis for clustering, probabilistic models for large networks, representation learning including topic modelling and non-negative matrix factorization, wavelets and compressed sensing. Important probabilistic techniques are developed including the law of large numbers, tail inequalities, analysis of random projections, generalization guarantees in machine learning, and moment methods for analysis of phase transitions in large random graphs. Additionally, important structural and complexity measures are discussed such as matrix norms and VC-dimension. This book is suitable for both undergraduate and graduate courses in the design and analysis of algorithms for data.
  data science for social good: Human-Centered Data Science Cecilia Aragon, Shion Guha, Marina Kogan, Michael Muller, Gina Neff, 2022-03-01 Best practices for addressing the bias and inequality that may result from the automated collection, analysis, and distribution of large datasets. Human-centered data science is a new interdisciplinary field that draws from human-computer interaction, social science, statistics, and computational techniques. This book, written by founders of the field, introduces best practices for addressing the bias and inequality that may result from the automated collection, analysis, and distribution of very large datasets. It offers a brief and accessible overview of many common statistical and algorithmic data science techniques, explains human-centered approaches to data science problems, and presents practical guidelines and real-world case studies to help readers apply these methods. The authors explain how data scientists’ choices are involved at every stage of the data science workflow—and show how a human-centered approach can enhance each one, by making the process more transparent, asking questions, and considering the social context of the data. They describe how tools from social science might be incorporated into data science practices, discuss different types of collaboration, and consider data storytelling through visualization. The book shows that data science practitioners can build rigorous and ethical algorithms and design projects that use cutting-edge computational tools and address social concerns.
  data science for social good: Statistics for Empowerment and Social Engagement Jim Ridgway, 2023-03-10 “This book is a remarkable achievement” Gerd Gigerenzer This book offers practical approaches to working in a new field of knowledge - Civic Statistics - which sets out to engage with, and overcome well documented and long-standing problems in teaching quantitative skills. The book includes 23 peer-reviewed chapters, written in coordination by an international group of experts from ten countries. The book aims to support and enhance the work of teachers and lecturers working both at the high school and tertiary (university) levels. It is designed to promote and improve the critical understanding of quantitative evidence relevant to burning social issues – such as epidemics, climate change, poverty, migration, natural disasters, inequality, employment, and racism. Effective citizen engagement with social issues requires active participation and a broad understanding of data and statistics about societal issues. However, many statistics curricula are not designed to teach relevant skills nor to improve learners' statistical literacy. Evidence about social issues is provided to the public via print and digital media, official statistics offices, and other information channels, and a great deal of data is accessible both as aggregated summaries and as individual records. Chapters illustrate the approaches needed to teach and promote the knowledge, skills, dispositions, and enabling processes associated with critical understanding of Civic Statistics presented in many forms. These include: statistical analysis of authentic multivariate data; use of dynamic data visualisations; deconstructing texts about the social and economic well-being of societies and communities. Chapters discuss: the development of curricula and educational resources; use of emerging technologies and visualizations; preparation of teachers and teaching approaches; sources for relevant datasets and rich texts about Civic Statistics; ideas regarding future research, assessment, collaborations between different stakeholders; and other systemic issues.
  data science for social good: Applying Computational Intelligence for Social Good , 2024-01-14 Applying Computational Intelligence for Social Good: Track, Understand and Build a Better World, Volume 132 presents views on how Computational Intelligent and ICT technologies can be applied to ease or solve social problems by sharing examples of research results from studies of social anxiety, environmental issues, mobility of the disabled, and problems in social safety. Sample chapters in this release include Why is implementing Computational Intelligence for social good so challenging? Principles and its Application, Smart crisis management system for road accidents using Geo-Spacial Machine Learning Techniques, Residential Energy Management System (REMS) Using Machine Learning, Text-Based Personality Prediction using XLNet, and much more. - Explores a number of key themes, including self-organization, complex adaptive systems, and emergent computation for solving socially relevant problems - Focuses on Forecasting applications, Human Behavior and Critics response analysis in social forums, Healthcare monitoring Systems, Disaster Management, Industrial management, and most recently, Epidemics and Outbreaks - Brings together many different aspects of the current research on intelligence technologies, such as neural networks, support vector machines, fuzzy logic, and evolutionary computation
  data science for social good: Breaking the Social Media Prism Chris Bail, 2022-09-27 A revealing look at how user behavior is powering deep social divisions online—and how we might yet defeat political tribalism on social media In an era of increasing social isolation, platforms like Facebook and Twitter are among the most important tools we have to understand each other. We use social media as a mirror to decipher our place in society but, as Chris Bail explains, it functions more like a prism that distorts our identities, empowers status-seeking extremists, and renders moderates all but invisible. Breaking the Social Media Prism challenges common myths about echo chambers, foreign misinformation campaigns, and radicalizing algorithms, revealing that the solution to political tribalism lies deep inside ourselves. Drawing on innovative online experiments and in-depth interviews with social media users from across the political spectrum, this book explains why stepping outside of our echo chambers can make us more polarized, not less. Bail takes you inside the minds of online extremists through vivid narratives that trace their lives on the platforms and off—detailing how they dominate public discourse at the expense of the moderate majority. Wherever you stand on the spectrum of user behavior and political opinion, he offers fresh solutions to counter political tribalism from the bottom up and the top down. He introduces new apps and bots to help readers avoid misperceptions and engage in better conversations with the other side. Finally, he explores what the virtual public square might look like if we could hit reset and redesign social media from scratch through a first-of-its-kind experiment on a new social media platform built for scientific research. Providing data-driven recommendations for strengthening our social media connections, Breaking the Social Media Prism shows how to combat online polarization without deleting our accounts.
  data science for social good: Data Science and Social Research N. Carlo Lauro, Enrica Amaturo, Maria Gabriella Grassia, Biagio Aragona, Marina Marino, 2017-11-17 This edited volume lays the groundwork for Social Data Science, addressing epistemological issues, methods, technologies, software and applications of data science in the social sciences. It presents data science techniques for the collection, analysis and use of both online and offline new (big) data in social research and related applications. Among others, the individual contributions cover topics like social media, learning analytics, clustering, statistical literacy, recurrence analysis and network analysis. Data science is a multidisciplinary approach based mainly on the methods of statistics and computer science, and its aim is to develop appropriate methodologies for forecasting and decision-making in response to an increasingly complex reality often characterized by large amounts of data (big data) of various types (numeric, ordinal and nominal variables, symbolic data, texts, images, data streams, multi-way data, social networks etc.) and from diverse sources. This book presents selected papers from the international conference on Data Science & Social Research, held in Naples, Italy in February 2016, and will appeal to researchers in the social sciences working in academia as well as in statistical institutes and offices.
  data science for social good: Roundtable on Data Science Postsecondary Education National Academies of Sciences, Engineering, and Medicine, Division of Behavioral and Social Sciences and Education, Division on Engineering and Physical Sciences, Board on Science Education, Computer Science and Telecommunications Board, Committee on Applied and Theoretical Statistics, Board on Mathematical Sciences and Analytics, 2020-10-02 Established in December 2016, the National Academies of Sciences, Engineering, and Medicine's Roundtable on Data Science Postsecondary Education was charged with identifying the challenges of and highlighting best practices in postsecondary data science education. Convening quarterly for 3 years, representatives from academia, industry, and government gathered with other experts from across the nation to discuss various topics under this charge. The meetings centered on four central themes: foundations of data science; data science across the postsecondary curriculum; data science across society; and ethics and data science. This publication highlights the presentations and discussions of each meeting.
  data science for social good: Machine Learning and Principles and Practice of Knowledge Discovery in Databases Michael Kamp, Irena Koprinska, Adrien Bibal, Tassadit Bouadi, Benoît Frénay, Luis Galárraga, José Oramas, Linara Adilova, Yamuna Krishnamurthy, Bo Kang, Christine Largeron, Jefrey Lijffijt, Tiphaine Viard, Pascal Welke, Massimiliano Ruocco, Erlend Aune, Claudio Gallicchio, Gregor Schiele, Franz Pernkopf, Michaela Blott, Holger Fröning, Günther Schindler, Riccardo Guidotti, Anna Monreale, Salvatore Rinzivillo, Przemyslaw Biecek, Eirini Ntoutsi, Mykola Pechenizkiy, Bodo Rosenhahn, Christopher Buckley, Daniela Cialfi, Pablo Lanillos, Maxwell Ramstead, Tim Verbelen, Pedro M. Ferreira, Giuseppina Andresini, Donato Malerba, Ibéria Medeiros, Philippe Fournier-Viger, M. Saqib Nawaz, Sebastian Ventura, Meng Sun, Min Zhou, Valerio Bitetta, Ilaria Bordino, Andrea Ferretti, Francesco Gullo, Giovanni Ponti, Lorenzo Severini, Rita Ribeiro, João Gama, Ricard Gavaldà, Lee Cooper, Naghmeh Ghazaleh, Jonas Richiardi, Damian Roqueiro, Diego Saldana Miranda, Konstantinos Sechidis, Guilherme Graça, 2022-02-18 This two-volume set constitutes the refereed proceedings of the workshops which complemented the 21th Joint European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD, held in September 2021. Due to the COVID-19 pandemic the conference and workshops were held online. The 104 papers were thoroughly reviewed and selected from 180 papers submited for the workshops. This two-volume set includes the proceedings of the following workshops:Workshop on Advances in Interpretable Machine Learning and Artificial Intelligence (AIMLAI 2021)Workshop on Parallel, Distributed and Federated Learning (PDFL 2021)Workshop on Graph Embedding and Mining (GEM 2021)Workshop on Machine Learning for Irregular Time-series (ML4ITS 2021)Workshop on IoT, Edge, and Mobile for Embedded Machine Learning (ITEM 2021)Workshop on eXplainable Knowledge Discovery in Data Mining (XKDD 2021)Workshop on Bias and Fairness in AI (BIAS 2021)Workshop on Workshop on Active Inference (IWAI 2021)Workshop on Machine Learning for Cybersecurity (MLCS 2021)Workshop on Machine Learning in Software Engineering (MLiSE 2021)Workshop on MIning Data for financial applications (MIDAS 2021)Sixth Workshop on Data Science for Social Good (SoGood 2021)Workshop on Machine Learning for Pharma and Healthcare Applications (PharML 2021)Second Workshop on Evaluation and Experimental Design in Data Mining and Machine Learning (EDML 2020)Workshop on Machine Learning for Buildings Energy Management (MLBEM 2021)
  data science for social good: Text as Data Justin Grimmer, Margaret E. Roberts, Brandon M. Stewart, 2022-03-29 A guide for using computational text analysis to learn about the social world From social media posts and text messages to digital government documents and archives, researchers are bombarded with a deluge of text reflecting the social world. This textual data gives unprecedented insights into fundamental questions in the social sciences, humanities, and industry. Meanwhile new machine learning tools are rapidly transforming the way science and business are conducted. Text as Data shows how to combine new sources of data, machine learning tools, and social science research design to develop and evaluate new insights. Text as Data is organized around the core tasks in research projects using text—representation, discovery, measurement, prediction, and causal inference. The authors offer a sequential, iterative, and inductive approach to research design. Each research task is presented complete with real-world applications, example methods, and a distinct style of task-focused research. Bridging many divides—computer science and social science, the qualitative and the quantitative, and industry and academia—Text as Data is an ideal resource for anyone wanting to analyze large collections of text in an era when data is abundant and computation is cheap, but the enduring challenges of social science remain. Overview of how to use text as data Research design for a world of data deluge Examples from across the social sciences and industry
  data science for social good: Data Analysis for Business, Economics, and Policy Gábor Békés, Gábor Kézdi, 2021-05-06 A comprehensive textbook on data analysis for business, applied economics and public policy that uses case studies with real-world data.
  data science for social good: Big Data and Social Science Ian Foster, Rayid Ghani, Ron S. Jarmin, Frauke Kreuter, Julia Lane, 2020-11-17 Big Data and Social Science: Data Science Methods and Tools for Research and Practice, Second Edition shows how to apply data science to real-world problems, covering all stages of a data-intensive social science or policy project. Prominent leaders in the social sciences, statistics, and computer science as well as the field of data science provide a unique perspective on how to apply modern social science research principles and current analytical and computational tools. The text teaches you how to identify and collect appropriate data, apply data science methods and tools to the data, and recognize and respond to data errors, biases, and limitations. Features: Takes an accessible, hands-on approach to handling new types of data in the social sciences Presents the key data science tools in a non-intimidating way to both social and data scientists while keeping the focus on research questions and purposes Illustrates social science and data science principles through real-world problems Links computer science concepts to practical social science research Promotes good scientific practice Provides freely available workbooks with data, code, and practical programming exercises, through Binder and GitHub New to the Second Edition: Increased use of examples from different areas of social sciences New chapter on dealing with Bias and Fairness in Machine Learning models Expanded chapters focusing on Machine Learning and Text Analysis Revamped hands-on Jupyter notebooks to reinforce concepts covered in each chapter This classroom-tested book fills a major gap in graduate- and professional-level data science and social science education. It can be used to train a new generation of social data scientists to tackle real-world problems and improve the skills and competencies of applied social scientists and public policy practitioners. It empowers you to use the massive and rapidly growing amounts of available data to interpret economic and social activities in a scientific and rigorous manner.
  data science for social good: Data Science for Undergraduates National Academies of Sciences, Engineering, and Medicine, Division of Behavioral and Social Sciences and Education, Board on Science Education, Division on Engineering and Physical Sciences, Committee on Applied and Theoretical Statistics, Board on Mathematical Sciences and Analytics, Computer Science and Telecommunications Board, Committee on Envisioning the Data Science Discipline: The Undergraduate Perspective, 2018-10-11 Data science is emerging as a field that is revolutionizing science and industries alike. Work across nearly all domains is becoming more data driven, affecting both the jobs that are available and the skills that are required. As more data and ways of analyzing them become available, more aspects of the economy, society, and daily life will become dependent on data. It is imperative that educators, administrators, and students begin today to consider how to best prepare for and keep pace with this data-driven era of tomorrow. Undergraduate teaching, in particular, offers a critical link in offering more data science exposure to students and expanding the supply of data science talent. Data Science for Undergraduates: Opportunities and Options offers a vision for the emerging discipline of data science at the undergraduate level. This report outlines some considerations and approaches for academic institutions and others in the broader data science communities to help guide the ongoing transformation of this field.
  data science for social good: Machine Learning and Knowledge Discovery in Databases Peggy Cellier, Kurt Driessens, 2020-03-27 This two-volume set constitutes the refereed proceedings of the workshops which complemented the 19th Joint European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD, held in Würzburg, Germany, in September 2019. The 70 full papers and 46 short papers presented in the two-volume set were carefully reviewed and selected from 200 submissions. The two volumes (CCIS 1167 and CCIS 1168) present the papers that have been accepted for the following workshops: Workshop on Automating Data Science, ADS 2019; Workshop on Advances in Interpretable Machine Learning and Artificial Intelligence and eXplainable Knowledge Discovery in Data Mining, AIMLAI-XKDD 2019; Workshop on Decentralized Machine Learning at the Edge, DMLE 2019; Workshop on Advances in Managing and Mining Large Evolving Graphs, LEG 2019; Workshop on Data and Machine Learning Advances with Multiple Views; Workshop on New Trends in Representation Learning with Knowledge Graphs; Workshop on Data Science for Social Good, SoGood 2019; Workshop on Knowledge Discovery and User Modelling for Smart Cities, UMCIT 2019; Workshop on Data Integration and Applications Workshop, DINA 2019; Workshop on Machine Learning for Cybersecurity, MLCS 2019; Workshop on Sports Analytics: Machine Learning and Data Mining for Sports Analytics, MLSA 2019; Workshop on Categorising Different Types of Online Harassment Languages in Social Media; Workshop on IoT Stream for Data Driven Predictive Maintenance, IoTStream 2019; Workshop on Machine Learning and Music, MML 2019; Workshop on Large-Scale Biomedical Semantic Indexing and Question Answering, BioASQ 2019. The chapter Supervised Human-guided Data Exploration is published open access under a Creative Commons Attribution 4.0 International license (CC BY).
  data science for social good: Machine Learning and Principles and Practice of Knowledge Discovery in Databases Irena Koprinska, Paolo Mignone, Riccardo Guidotti, Szymon Jaroszewicz, Holger Fröning, Francesco Gullo, Pedro M. Ferreira, Damian Roqueiro, Gaia Ceddia, Slawomir Nowaczyk, João Gama, Rita Ribeiro, Ricard Gavaldà, Elio Masciari, Zbigniew Ras, Ettore Ritacco, Francesca Naretto, Andreas Theissler, Przemyslaw Biecek, Wouter Verbeke, Gregor Schiele, Franz Pernkopf, Michaela Blott, Ilaria Bordino, Ivan Luciano Danesi, Giovanni Ponti, Lorenzo Severini, Annalisa Appice, Giuseppina Andresini, Ibéria Medeiros, Guilherme Graça, Lee Cooper, Naghmeh Ghazaleh, Jonas Richiardi, Diego Saldana, Konstantinos Sechidis, Arif Canakoglu, Sara Pido, Pietro Pinoli, Albert Bifet, Sepideh Pashami, 2023-01-30 This volume constitutes the papers of several workshops which were held in conjunction with the International Workshops of ECML PKDD 2022 on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2022, held in Grenoble, France, during September 19–23, 2022. The 73 revised full papers and 6 short papers presented in this book were carefully reviewed and selected from 143 submissions. ECML PKDD 2022 presents the following workshops: Workshop on Data Science for Social Good (SoGood 2022) Workshop on New Frontiers in Mining Complex Patterns (NFMCP 2022) Workshop on Explainable Knowledge Discovery in Data Mining (XKDD 2022) Workshop on Uplift Modeling (UMOD 2022) Workshop on IoT, Edge and Mobile for Embedded Machine Learning (ITEM 2022) Workshop on Mining Data for Financial Application (MIDAS 2022) Workshop on Machine Learning for Cybersecurity (MLCS 2022) Workshop on Machine Learning for Buildings Energy Management (MLBEM 2022) Workshop on Machine Learning for Pharma and Healthcare Applications (PharML 2022) Workshop on Data Analysis in Life Science (DALS 2022) Workshop on IoT Streams for Predictive Maintenance (IoT-PdM 2022)
  data science for social good: Social Network Data Analytics Charu C. Aggarwal, 2011-03-18 Social network analysis applications have experienced tremendous advances within the last few years due in part to increasing trends towards users interacting with each other on the internet. Social networks are organized as graphs, and the data on social networks takes on the form of massive streams, which are mined for a variety of purposes. Social Network Data Analytics covers an important niche in the social network analytics field. This edited volume, contributed by prominent researchers in this field, presents a wide selection of topics on social network data mining such as Structural Properties of Social Networks, Algorithms for Structural Discovery of Social Networks and Content Analysis in Social Networks. This book is also unique in focussing on the data analytical aspects of social networks in the internet scenario, rather than the traditional sociology-driven emphasis prevalent in the existing books, which do not focus on the unique data-intensive characteristics of online social networks. Emphasis is placed on simplifying the content so that students and practitioners benefit from this book. This book targets advanced level students and researchers concentrating on computer science as a secondary text or reference book. Data mining, database, information security, electronic commerce and machine learning professionals will find this book a valuable asset, as well as primary associations such as ACM, IEEE and Management Science.
  data science for social good: ECML PKDD 2020 Workshops Irena Koprinska, Michael Kamp, Annalisa Appice, Corrado Loglisci, Luiza Antonie, Albrecht Zimmermann, Riccardo Guidotti, Özlem Özgöbek, Rita P. Ribeiro, Ricard Gavaldà, João Gama, Linara Adilova, Yamuna Krishnamurthy, Pedro M. Ferreira, Donato Malerba, Ibéria Medeiros, Michelangelo Ceci, Giuseppe Manco, Elio Masciari, Zbigniew W. Ras, Peter Christen, Eirini Ntoutsi, Erich Schubert, Arthur Zimek, Anna Monreale, Przemyslaw Biecek, Salvatore Rinzivillo, Benjamin Kille, Andreas Lommatzsch, Jon Atle Gulla, 2021-02-01 This volume constitutes the refereed proceedings of the workshops which complemented the 20th Joint European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD, held in September 2020. Due to the COVID-19 pandemic the conference and workshops were held online. The 43 papers presented in volume were carefully reviewed and selected from numerous submissions. The volume presents the papers that have been accepted for the following workshops: 5th Workshop on Data Science for Social Good, SoGood 2020; Workshop on Parallel, Distributed and Federated Learning, PDFL 2020; Second Workshop on Machine Learning for Cybersecurity, MLCS 2020, 9th International Workshop on New Frontiers in Mining Complex Patterns, NFMCP 2020, Workshop on Data Integration and Applications, DINA 2020, Second Workshop on Evaluation and Experimental Design in Data Mining and Machine Learning, EDML 2020, Second International Workshop on eXplainable Knowledge Discovery in Data Mining, XKDD 2020; 8th International Workshop on News Recommendation and Analytics, INRA 2020. The papers from INRA 2020 are published open access and licensed under the terms of the Creative Commons Attribution 4.0 International License.
  data science for social good: Quantitative Social Science Kosuke Imai, Lori D. Bougher, 2021-03-16 Princeton University Press published Imai's textbook, Quantitative Social Science: An Introduction, an introduction to quantitative methods and data science for upper level undergrads and graduates in professional programs, in February 2017. What is distinct about the book is how it leads students through a series of applied examples of statistical methods, drawing on real examples from social science research. The original book was prepared with the statistical software R, which is freely available online and has gained in popularity in recent years. But many existing courses in statistics and data sciences, particularly in some subject areas like sociology and law, use STATA, another general purpose package that has been the market leader since the 1980s. We've had several requests for STATA versions of the text as many programs use it by default. This is a translation of the original text, keeping all the current pedagogical text but inserting the necessary code and outputs from STATA in their place--
  data science for social good: Reinventing the Social Scientist and Humanist in the Era of Big Data Susan Brokensha, Eduan Kotzé, Burgert A. Senekal, 2019-12-01 This book explores the big data evolution by interrogating the notion that big data is a disruptive innovation that appears to be challenging existing epistemologies in the humanities and social sciences. Exploring various (controversial) facets of big data such as ethics, data power, and data justice, the book attempts to clarify the trajectory of the epistemology of (big) data-driven science in the humanities and social sciences.
  data science for social good: Data Visualization Kieran Healy, 2018-12-18 An accessible primer on how to create effective graphics from data This book provides students and researchers a hands-on introduction to the principles and practice of data visualization. It explains what makes some graphs succeed while others fail, how to make high-quality figures from data using powerful and reproducible methods, and how to think about data visualization in an honest and effective way. Data Visualization builds the reader’s expertise in ggplot2, a versatile visualization library for the R programming language. Through a series of worked examples, this accessible primer then demonstrates how to create plots piece by piece, beginning with summaries of single variables and moving on to more complex graphics. Topics include plotting continuous and categorical variables; layering information on graphics; producing effective “small multiple” plots; grouping, summarizing, and transforming data for plotting; creating maps; working with the output of statistical models; and refining plots to make them more comprehensible. Effective graphics are essential to communicating ideas and a great way to better understand data. This book provides the practical skills students and practitioners need to visualize quantitative data and get the most out of their research findings. Provides hands-on instruction using R and ggplot2 Shows how the “tidyverse” of data analysis tools makes working with R easier and more consistent Includes a library of data sets, code, and functions
  data science for social good: Artificial Intelligence and Social Work Milind Tambe, Eric Rice, 2018-11-29 An introductory guide with real-life examples on using AI to help homeless youth, diabetes patients, and other social welfare interventions.
  data science for social good: ECML PKDD 2018 Workshops Carlos Alzate, Anna Monreale, Haytham Assem, Albert Bifet, Teodora Sandra Buda, Bora Caglayan, Brett Drury, Eva García-Martín, Ricard Gavaldà, Irena Koprinska, Stefan Kramer, Niklas Lavesson, Michael Madden, Ian Molloy, Maria-Irina Nicolae, Mathieu Sinn, 2019-02-15 This book constitutes revised selected papers from the workshops Nemesis, UrbReas, SoGood, IWAISe, and Green Data Mining, held at the 18th European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2018, in Dublin, Ireland, in September 2018. The 20 papers presented in this volume were carefully reviewed and selected from a total of 32 submissions. The workshops included are: Nemesis 2018: First Workshop on Recent Advances in Adversarial Machine Learning UrbReas 2018: First International Workshop on Urban Reasoning from Complex Challenges in Cities SoGood 2018: Third Workshop on Data Science for Social Good IWAISe 2018: Second International Workshop on Artificial Intelligence in Security Green Data Mining 2018: First International Workshop on Energy Efficient Data Mining and Knowledge Discovery
Data Science for Social Good - Mendoza College of Business
Against this backdrop, we present a framework for “data science for social good” (DSSG) research that considers the interplay between relevant data science research genres, social …

SIADS_688_Data_Science_for_Social_Good_w21_Chen
This course analyzes the motivations and incentives for people to contribute to public goods. Students will learn how to apply causal inference techniques and social science theories to …

Big Data for Social Good Syllabus - Harvard Online
Big Data for Social Good will teach you how to use big data, coupled with the tools of data science and economics, to solve some of the most important social problems of our time.

Code and Data for the Social Sciences: A Practitioner’s Guide
This handbook is about translating insights from experts in code and data into practical terms for empirical social scientists. We are not ourselves software engineers, database managers, or …

Impact and Implementation of Data Science for Social Good …
Abstract: The focus of this study is on assessing the success factors of data science projects within , as technology laggards, and on projects on these organizations. Using the Dynamic …

The Data for Good Growth Map - escience.washington.edu
The Data Science for Social Good program at Carnegie Mellon University (formerly at the University of Chicago) has tackled a wide variety of social challenges in locations across the …

Microsoft Word - Tanweer_CSC_in_DSSG.docx - Data Science …
Based on three years of fieldwork with multi-sector collaborations in the space of data science for social good, we argue that there are five distinct contributions that motivate collaborations with …

JAIS Special Issue: Data Science for Social Good Editorial Articles
Champions for Social Good: How Can We Discover Social Sentiment and Attitude-Driven Patterns in Prosocial Communication? Raghava Mukkamala, Robert J. Kauffman, and Helle …

data science looks like Harnessing the potential of What data …
Using data science for social good requires impact-based projects that are built on adequate data . But above all, we need the expertise of social data scientists who ask the following questions:

14: DATA FOR THE SOCIAL GOOD: TOWARD A DATA …
ect, process and repurpose data to fuel research for the social good. Problematizing the mainstream connotations of big data, these examples of 'data activ-ism' take a critical stance …

Scaling Up Data Science for the Social Sciences
Jun 7, 2021 · As computational tools and resources become more prevalent and data sources increase the quantity and quality of human data, the social scientists of tomorrow must also be …

Data Science for Social Good - ResearchGate
Against this backdrop, we present a framework for “data science for social good” (DSSG) research that considers the interplay between relevant data science research genres, social...

Solve for Good: A Data Science for Social Good Marketplace
In discussions with hundreds of government agencies and non-profits, and in experiences running data science for social good programs, we have found that there are many volunteers who …

Unit 3 - Lesson 10 Data for Social Good Project Day 1
Data for Social Good Project: Day 1 Choose your user story. Brainstorm questions. Create UML diagrams. Plan your algorithms.

Data for Good Exchange 2019 Data for Good Exchange 2019: …
related to the social good. This forum has brought together participants from across academia, industry, government and NGOs, enabling them to share insights and progress on applying …

Big ideas, small data: Opportunities and challenges for data …
One such deployment is DS for Social Good, in which data science strategies are applied to social problems. DS for Social Good tackles an array of societal challenges such as algorithmic …

Open Platforms for Artificial Intelligence for Social Good: …
In this paper, we discuss how moving from demonstra-tions to true impact on humanity will require a different course of action, namely open plat-forms containing foundational AI capabilities to …

Small Data, Big Justice: The Intersection of Data Science, …
In this editorial, we aim to review the concept of Big Data from a critical viewpoint, then introduce a conceptual definition of Small Data, which we think, offers an antidote to the impersonal, …

Data Science as Political Action: Grounding Data Science in a …
In this article, I argue that data science must embrace a political orientation. Data scientists must recognize themselves as political actors engaged in normative constructions of society and …

Call for Papers—INFORMS Journal on Computing: Special …
To address social problems with an emphasis on human values and ethics, AI and data science solutions should be built based on five major principles: unbiased results, transparency, …

Data Science for Social Good - Mendoza College of Business
Against this backdrop, we present a framework for “data science for social good” (DSSG) research that considers the interplay between relevant data science research genres, social …

SIADS_688_Data_Science_for_Social_Good_w21_Chen
This course analyzes the motivations and incentives for people to contribute to public goods. Students will learn how to apply causal inference techniques and social science theories to …

Big Data for Social Good Syllabus - Harvard Online
Big Data for Social Good will teach you how to use big data, coupled with the tools of data science and economics, to solve some of the most important social problems of our time.

Code and Data for the Social Sciences: A Practitioner’s Guide
This handbook is about translating insights from experts in code and data into practical terms for empirical social scientists. We are not ourselves software engineers, database managers, or …

Impact and Implementation of Data Science for Social Good …
Abstract: The focus of this study is on assessing the success factors of data science projects within , as technology laggards, and on projects on these organizations. Using the Dynamic …

The Data for Good Growth Map - escience.washington.edu
The Data Science for Social Good program at Carnegie Mellon University (formerly at the University of Chicago) has tackled a wide variety of social challenges in locations across the …

Microsoft Word - Tanweer_CSC_in_DSSG.docx - Data …
Based on three years of fieldwork with multi-sector collaborations in the space of data science for social good, we argue that there are five distinct contributions that motivate collaborations with …

JAIS Special Issue: Data Science for Social Good Editorial …
Champions for Social Good: How Can We Discover Social Sentiment and Attitude-Driven Patterns in Prosocial Communication? Raghava Mukkamala, Robert J. Kauffman, and Helle …

data science looks like Harnessing the potential of What data …
Using data science for social good requires impact-based projects that are built on adequate data . But above all, we need the expertise of social data scientists who ask the following questions:

14: DATA FOR THE SOCIAL GOOD: TOWARD A DATA …
ect, process and repurpose data to fuel research for the social good. Problematizing the mainstream connotations of big data, these examples of 'data activ-ism' take a critical stance …

Scaling Up Data Science for the Social Sciences
Jun 7, 2021 · As computational tools and resources become more prevalent and data sources increase the quantity and quality of human data, the social scientists of tomorrow must also be …

Data Science for Social Good - ResearchGate
Against this backdrop, we present a framework for “data science for social good” (DSSG) research that considers the interplay between relevant data science research genres, social...

Solve for Good: A Data Science for Social Good Marketplace
In discussions with hundreds of government agencies and non-profits, and in experiences running data science for social good programs, we have found that there are many volunteers who …

Unit 3 - Lesson 10 Data for Social Good Project Day 1
Data for Social Good Project: Day 1 Choose your user story. Brainstorm questions. Create UML diagrams. Plan your algorithms.

Data for Good Exchange 2019 Data for Good Exchange …
related to the social good. This forum has brought together participants from across academia, industry, government and NGOs, enabling them to share insights and progress on applying …

Big ideas, small data: Opportunities and challenges for data …
One such deployment is DS for Social Good, in which data science strategies are applied to social problems. DS for Social Good tackles an array of societal challenges such as algorithmic …

Open Platforms for Artificial Intelligence for Social Good: …
In this paper, we discuss how moving from demonstra-tions to true impact on humanity will require a different course of action, namely open plat-forms containing foundational AI capabilities to …

Small Data, Big Justice: The Intersection of Data Science, …
In this editorial, we aim to review the concept of Big Data from a critical viewpoint, then introduce a conceptual definition of Small Data, which we think, offers an antidote to the impersonal, …

Data Science as Political Action: Grounding Data Science in …
In this article, I argue that data science must embrace a political orientation. Data scientists must recognize themselves as political actors engaged in normative constructions of society and …

Call for Papers—INFORMS Journal on Computing: Special …
To address social problems with an emphasis on human values and ethics, AI and data science solutions should be built based on five major principles: unbiased results, transparency, …