Gør som tusindvis af andre bogelskere
Tilmeld dig nyhedsbrevet og få gode tilbud og inspiration til din næste læsning.
Ved tilmelding accepterer du vores persondatapolitik.Du kan altid afmelde dig igen.
This volume LNCS-IFIP constitutes the refereed proceedings of the 7th IFIP TC 5, TC 12, WG 8.4, WG 8.9, WG 12.9 International Cross-Domain Conference, CD-MAKE 2023 in Benevento, Italy, during August 28 ¿ September 1, 2023. The 18 full papers presented together were carefully reviewed and selected from 30 submissions. The conference focuses on integrative machine learning approach, considering the importance of data science and visualization for the algorithmic pipeline with a strong emphasis on privacy, data protection, safety and security.
This book is a major update to the very successful first and second editions (2005 and 2010) of Data Mining and Knowledge Discovery Handbook. Since the last edition, this field has continued to evolve and to gain popularity. Existing methods are constantly being improved and new methods, applications and aspects are introduced. The new title of this handbook and its content reflect these changes thoroughly. Some existing chapters have been brought up to date. In addition to major revision of the existing chapters, the new edition includes totally new topics, such as: deep learning, explainable AI, human factors and social issues and advanced methods for big-data. The significant enhancement to the content reflects the growth in importance of data science. The third edition is also a timely opportunity to incorporate many other changes based on peers and students¿ feedback.This comprehensive handbook also presents a coherent and unified repository of data science major concepts, theories, methods, trends, challenges and applications. It covers all the crucial important machine learning methods used in data science.Today's accessibility and abundance of data make data science matters of considerable importance and necessity. Given the field's recent growth, it's not surprising that researchers and practitioners now have a wide range of methods and tools at their disposal. While statistics is fundamental for data science, methods originated from artificial intelligence, particularly machine learning, are also playing a significant role.This handbook aims to serve as the main reference for researchers in the fields of information technology, e-Commerce, information retrieval, data science, machine learning, data mining, databases and statistics as well as advanced level students studying computer science or electrical engineering. Practitioners working within these related fields and data scientists will also want to purchase this handbook as a reference.
If programming is magic then web scraping is surely a form of wizardry. By writing a simple automated program, you can query web servers, request data, and parse it to extract the information you need. The expanded edition of this practical book not only introduces you web scraping, but also serves as a comprehensive guide to scraping almost every type of data from the modern web.Part I focuses on web scraping mechanics: using Python to request information from a web server, performing basic handling of the servers response, and interacting with sites in an automated fashion. Part II explores a variety of more specific tools and applications to fit any web scraping scenario youre likely to encounter.Parse complicated HTML pagesDevelop crawlers with the Scrapy frameworkLearn methods to store data you scrapeRead and extract data from documentsClean and normalize badly formatted dataRead and write natural languagesCrawl through forms and loginsScrape JavaScript and crawl through APIsUse and write image-to-text softwareAvoid scraping traps and bot blockersUse scrapers to test your website
This book constitutes the proceedings of the 25th International Conference on Big Data Analytics and Knowledge Discovery, DaWaK 2023, which took place in Penang, Malaysia, during August 29-30, 2023. The 18 full papers presented together with 19 short papers were carefully reviewed and selected from a total of 83 submissions.They were organized in topical sections as follows: Data quality; advanced analytics and pattern discovery; machine learning; deep learning; and data management.
This book constitutes the refereed post-proceedings of the 14th International Conference on Computer Supported Education, CSEDU 2022, Virtual Event, April 22¿24, 2022. The conference was held virtually due to the COVID-19 crisis.The 8 full papers included in this book were carefully reviewed and selected from 181 submissions. The papers included in CSEDU 2022 proceedings contribute to the understanding of relevant trends of current research on Computer Supported Education, including: Emerging Technologies in Education for Sustainable Development, Instructional Design, Pre-K/K-12 Education, Machine Learning, Learning with AI Systems, Higher Order Thinking Skills, Game-Based and Simulation-Based Learning, Educational Data Mining, Course Design and eLearning Curriculae and Constructivism and Social Constructivism.
This book systemically presents key concepts of multi-modal hashing technology, recent advances on large-scale efficient multimedia search and recommendation, and recent achievements in multimedia indexing technology. With the explosive growth of multimedia contents, multimedia retrieval is currently facing unprecedented challenges in both storage cost and retrieval speed. The multi-modal hashing technique can project high-dimensional data into compact binary hash codes. With it, the most time-consuming semantic similarity computation during the multimedia retrieval process can be significantly accelerated with fast Hamming distance computation, and meanwhile the storage cost can be reduced greatly by the binary embedding. The authors introduce the categorization of existing multi-modal hashing methods according to various metrics and datasets. The authors also collect recent multi-modal hashing techniques and describe the motivation, objective formulations, and optimization steps for context-aware hashing methods based on the tag-semantics transfer.
This two-volume set LNAI 13995 and LNAI 13996 constitutes the refereed proceedings of the 15th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2023, held in Phuket, Thailand, during July 24-26, 2023.The 65 full papers presented in these proceedings were carefully reviewed and selected from 224 submissions. The papers of the 2 volume-set are organized in the following topical sections: Case-Based Reasoning and Machine Comprehension; Computer Vision; Data Mining and Machine Learning; Knowledge Integration and Analysis; Speech and Text Processing; and Resource Management and Optimization.
This book presents high-quality research papers presented at 2nd International Conference on Smart Data Intelligence (ICSMDI 2022) organized by Kongunadu College of Engineering and Technology at Trichy, Tamil Nadu, India, during April 2022. This book brings out the new advances and research results in the fields of algorithmic design, data analysis, and implementation on various real-time applications. It discusses many emerging related fields like big data, data science, artificial intelligence, machine learning, and deep learning which have deployed a paradigm shift in various data-driven approaches that tends to evolve new data-driven research opportunities in various influential domains like social networks, healthcare, information, and communication applications.
This book provides a practical and fairly comprehensive review of Data Science through the lens of dimensionality reduction, as well as hands-on techniques to tackle problems with data collected in the real world. State-of-the-art results and solutions from statistics, computer science and mathematics are explained from the point of view of a practitioner in any domain science, such as biology, cyber security, chemistry, sports science and many others. Quantitative and qualitative assessment methods are described to implement and validate the solutions back in the real world where the problems originated.The ability to generate, gather and store volumes of data in the order of tera- and exo bytes daily has far outpaced our ability to derive useful information with available computational resources for many domains.This book focuses on data science and problem definition, data cleansing, feature selection and extraction, statistical, geometric, information-theoretic, biomolecular and machine learning methods for dimensionality reduction of big datasets and problem solving, as well as a comparative assessment of solutions in a real-world setting.This book targets professionals working within related fields with an undergraduate degree in any science area, particularly quantitative. Readers should be able to follow examples in this book that introduce each method or technique. These motivating examples are followed by precise definitions of the technical concepts required and presentation of the results in general situations. These concepts require a degree of abstraction that can be followed by re-interpreting concepts like in the original example(s). Finally, each section closes with solutions to the original problem(s) afforded by these techniques, perhaps in various ways to compare and contrast dis/advantages to other solutions.
This book is intended as an introduction to a versatile model for temporal data. It exhibits an original lattice structure on the space of chronicles and proposes new counting approach for multiple occurrences of chronicle occurrences. This book also proposes a new approach for frequent temporal pattern mining using pattern structures. This book was initiated by the work of Ch. Dousson in the 1990¿s. At that time, the prominent format was Temporal Constraint Networks for which the article by Richter, Meiri and Pearl is seminal.Chronicles do not conflict with temporal constraint networks, they are closely related. Not only do they share a similar graphical representation, they also have in common a notion of constraints in the timed succession of events. However, chronicles are definitely oriented towards fairly specific tasks in handling temporal data, by making explicit certain aspects of temporal data such as repetitions of an event. The notion of chronicle has been applied both for situation recognition and temporal sequence abstraction. The first challenge benefits from the simple but expressive formalism to specify temporal behavior to match in a temporal sequence. The second challenge aims to abstract a collection of sequences by chronicles with the objective to extract characteristic behaviors.This book targets researchers and students in computer science (from logic to data science). Engineers who would like to develop algorithms based on temporal models will also find this book useful.
This monograph provides comprehensive coverage of the collection, management, and use of big data obtained from remote sensing. The book begins with an introduction to the basics of big data and remote sensing, laying the groundwork for the more specialized information to follow. The volume then goes on to address a wide variety of topics related to the use and management of remote sensing big data, including hot topics such as analysis through machine learning, cyberinfrastructure, and modeling. Examples on how to use the results of big data analysis of remotely sensed data for concrete decision-making are offered as well. The closing chapters discuss geospatial big data initiatives throughout the world and future challenges and opportunities for remote sensing big data applications. The audience for this book includes researchers at the intersection of geoscience and data science, senior undergraduate and graduate students, and anyone else interested in how large datasets obtained through remote sensing can be best utilized. The book presents a culmination of 30 years of research from renowned spatial scientists Drs. Liping Di and Eugene Yu.
This book offers an excellent source of knowledge for readers who are interested in keeping up with the developments in the field of cyber security and social media analysis. It covers the possibility of using augmented reality to monitor cyber security feeds in a multitasking environment. It describes a real-time scheduled security scanner. E-commerce concept labeling is tackled by introducing a lightweight global taxonomy induction system. Blogsphere analytics and online video narratives and networks are explored. The effect of global and local network structure, credibility based prevention of fake news dissemination, and detection of trending topics and influence from social media are investigated.This book helps the reader in developing their own perspective about how to deal with cyber security and how to benefit from the development in technology to tackle cyber security issues. The reader of this book will realize how to use various machine learning techniques for tackling various applications involving social medial analysis.
This book constitutes the proceedings of the 6th IFIP TC 12 International Conference on Computational Intelligence in Data Science, ICCIDS 2023, which took place in Kalavakkam, India, in February 2023.The 24 full papers presented in this volume were carefully reviewed and selected from 134 submissions. The major theme of the conference was intended to be computation intelligence and knowledge management. Various emerging areas like IoT, cyber security and data science need computation intelligence to align with the cutting-edge research. Machine learning delivers insights hidden in data for rapid, automated responses and improved decision making. Machine learning for IoT can be used to project future trends, detect anomalies, and augment intelligence by ingesting image, video, and audio.
This book presents Proceedings of the International Conference on Intelligent Systems and Networks (ICISN 2022), held at Hanoi in Vietnam. It includes peer reviewed high quality articles on Intelligent System and Networks. It brings together professionals and researchers in the area and presents a platform for exchange of ideas and to foster future collaboration. The topics covered in this book include- Foundations of Computer Science; Computational Intelligence Language and speech processing; Software Engineering Software development methods; Wireless Communications Signal Processing for Communications; Electronics track IoT and Sensor Systems Embedded Systems; etc.
This book introduces a novel perspective on machine learning, offering distinct advantages over neural network-based techniques. This approach boasts a reduced hardware requirement, lower power consumption, and enhanced interpretability. The applications of this approach encompass high-speed classifications, including packet classification, network intrusion detection, and exotic particle detection in high-energy physics. Moreover, it finds utility in medical diagnosis scenarios characterized by small training sets and imbalanced data. The resulting rule generated by this method can be implemented either in software or hardware. In the case of hardware implementation, circuit design can employ look-up tables (memory), rather than threshold gates.The methodology described in this book involves extracting a set of rules from a training set, composed of categorical variable vectors and their corresponding classes. Unnecessary variables are eliminated, and the rules are simplified before being transformed into a sum-of-products (SOP) form. The resulting SOP exhibits the ability to generalize and predict outputs for new inputs. The effectiveness of this approach is demonstrated through numerous examples and experimental results using the University of California-Irvine (UCI) dataset.This book is primarily intended for graduate students and researchers in the fields of logic synthesis, machine learning, and data mining. It assumes a foundational understanding of logic synthesis, while familiarity with linear algebra and statistics would be beneficial for readers.
This book introduces the Topological Weighed Centroid approach and describes some applications in the study of the dynamics of various spatial phenomena with a special emphasis on the spatial analysis of the relationship, influence, and dynamics of geographical phenomena. Offering a comprehensive introduction to the theory and illustrative examples from various kinds of geographical data, this book also takes an in-depth look at more complex case studies, such as the applications of the topological weighed centroid approach in the study of epidemic patterns, cultural processes, criminality, and environmental phenomena.
This book introduces the concept of ¿bespoke learning¿, a new mechanistic approach that makes it possible to generate values of an output variable at each designated value of an associated input variable. Here the output variable generally provides information about the system¿s behaviour/structure, and the aim is to learn the input-output relationship, even though little to no information on the output is available, as in multiple real-world problems. Once the output values have been bespoke-learnt, the originally-absent training set of input-output pairs becomes available, so that (supervised) learning of the sought inter-variable relation is then possible. Three ways of undertaking such bespoke learning are offered: by tapping into system dynamics in generic dynamical systems, to learn the function that causes the system¿s evolution; by comparing realisations of a random graph variable, given multivariate time series datasets of disparate temporal coverage; and by designing maximally information-availing likelihoods in static systems. These methodologies are applied to four different real-world problems: forecasting daily COVID-19 infection numbers; learning the gravitational mass density in a real galaxy; learning a sub-surface material density function; and predicting the risk of onset of a disease following bone marrow transplants. Primarily aimed at graduate and postgraduate students studying a field which includes facets of statistical learning, the book will also benefit experts working in a wide range of applications. The prerequisites are undergraduate level probability and stochastic processes, and preliminary ideas on Bayesian statistics.
This book is your practical and comprehensive guide to learning Google Cloud Platform (GCP) for data science, using only the free tier services offered by the platform.Data science and machine learning are increasingly becoming critical to businesses of all sizes, and the cloud provides a powerful platform for these applications. GCP offers a range of data science services that can be used to store, process, and analyze large datasets, and train and deploy machine learning models.The book is organized into seven chapters covering various topics such as GCP account setup, Google Colaboratory, Big Data and Machine Learning, Data Visualization and Business Intelligence, Data Processing and Transformation, Data Analytics and Storage, and Advanced Topics. Each chapter provides step-by-step instructions and examples illustrating how to use GCP services for data science and big data projects.Readers will learn how to set up a Google Colaboratory account and run Jupyternotebooks, access GCP services and data from Colaboratory, use BigQuery for data analytics, and deploy machine learning models using Vertex AI. The book also covers how to visualize data using Looker Data Studio, run data processing pipelines using Google Cloud Dataflow and Dataprep, and store data using Google Cloud Storage and SQL.What You Will LearnSet up a GCP account and projectExplore BigQuery and its use cases, including machine learningUnderstand Google Cloud AI Platform and its capabilities Use Vertex AI for training and deploying machine learning modelsExplore Google Cloud Dataproc and its use cases for big data processingCreate and share data visualizations and reports with Looker Data StudioExplore Google Cloud Dataflow and its use cases for batch and stream data processing Run data processing pipelines on Cloud DataflowExplore Google Cloud Storageand its use cases for data storage Get an introduction to Google Cloud SQL and its use cases for relational databases Get an introduction to Google Cloud Pub/Sub and its use cases for real-time data streamingWho This Book Is ForData scientists, machine learning engineers, and analysts who want to learn how to use Google Cloud Platform (GCP) for their data science and big data projects
Learn to write SQL queries to select and analyze data, and improve your ability to manipulate data. This book will help you take your existing skills to the next level.Author Mark Simon kicks things off with a quick review of basic SQL knowledge, followed by a demonstration of how efficient SQL databases are designed and how to extract just the right data from them. Yoüll then learn about each individual table¿s structure and how to work with the relationships between tables. As you progress through the book, you will learn more sophisticated techniques such as using common table expressions and subqueries, analyzing your data using aggregate and windowing functions, and how to save queries in the form of views and other methods. This book employs an accessible approach to work through a realistic sample, enabling you to learn concepts as they arise to improve parts of the database or to work with the data itself.After completing this book, you will have a more thorough understanding of database structure and how to use advanced techniques to extract, manage, and analyze data.What Will You LearnGain a stronger understanding of database design principles, especially individual tablesUnderstand the relationships between tablesUtilize techniques such as views, subqueries, common table expressions, and windowing functionsWho Is This Book For:SQL Databases users who want to improve their knowledge and techniques.
This volume constitutes poster papers and late breaking results presented during the 24th International Conference on Artificial Intelligence in Education, AIED 2023, Tokyo, Japan, July 3¿7, 2023.The 65 poster papers presented were carefully reviewed and selected from 311 submissions. This set of posters was complemented with the other poster contributions submitted for the Poster and Late Breaking results track of the AIED 2023 conference.
This book is the proceeding of the 1st International Conference on Distributed Sensing and Intelligent Systems (ICDSIS2020) which will be held in The National School of Applied Sciences of Agadir, Ibn Zohr University, Agadir, Morocco on February 01-03, 2020. ICDSIS2020 is co-organized by Computer Vision and Intelligent Systems Lab, University of North Texas, USA as a scientific collaboration event with The National School of Applied Sciences of Agadir, Ibn Zohr University. ICDSIS2020 aims to foster students, researchers, academicians and industry persons in the field of Computer and Information Science, Intelligent Systems, and Electronics and Communication Engineering in general. The volume collects contributions from leading experts around the globe with the latest insights on emerging topics, and includes reviews, surveys, and research chapters covering all aspects of distributed sensing and intelligent systems. The volume is divided into 5 key sections: Distributed Sensing Applications; Intelligent Systems; Advanced theories and algorithms in machine learning and data mining; Artificial intelligence and optimization, and application to Internet of Things (IoT); and Cybersecurity and Secure Distributed Systems. This conference proceeding is an academic book which can be read by students, analysts, policymakers, and regulators interested in Distributed Sensing, Smart Network approaches, Smart Cities, IoT Applications, and Intelligent Applications. It is written in plain and easy language, and describes new concepts when they appear first so that a reader without prior background of the field finds it readable. The book is primarily intended for research students in sensor networks and IoT applications (including intelligent information systems, and smart sensors applications), academics in higher education institutions including universities and vocational colleges, policy makers and legislators.
This book constitutes the refereed proceedings of the 24th International Conference on Artificial Intelligence in Education, AIED 2023, held in Tokyo, Japan, during July 3-7, 2023. This event took place in hybrid mode.The 53 full papers and 26 short papers presented in this book were carefully reviewed and selected from 311 submissions. The papers present result in high-quality research on intelligent systems and the cognitive sciences for the improvement and advancement of education. The conference was hosted by the prestigious International Artificial Intelligence in Education Society, a global association of researchers and academics specializing in the many fields that comprise AIED, including, but not limited to, computer science, learning sciences, and education.
This book constitutes the proceedings of the 17th International Conference on Formal Concept Analysis, ICFCA 2023, which took place in Kassel, Germany, in July 2023.The 13 full papers presented in this volume were carefully reviewed and selected from 19 submissions. The International Conference on Formal Concept Analysis serves as a platform for researchers from FCA and related disciplines to showcase and exchange their research findings. The papers are organized in two topical sections, first "Theory" and second "Applications and Visualization".
In der Informatik sind einige Fachleute der Meinung, dass das Mainframe Computing seine bedeutende Zeit berschritten hat. In den letzten 10 Jahren ist die Mainframe-Ausbildung weiter eingeschränkt worden, d.h., Vorlesungen für interessierte Studenten an deutschen Universitäten und Fachhochschulen existieren teilweise nur noch online im E-Learning. Das betrifft auch die dazugehörigen praktischen Übungen. Diese Entwicklung widerspricht insgesamt dem Bedarf von Großrechner-Fachpersonal in mittleren und großen Unternehmen. In diesem Bereich ist das Problem dadurch gewachsen, dass Mitarbeiter in den Ruhestand gewechselt sind und nicht mehr zur Verfügung stehen. In dem Buch Einführung in z/OS und OS/390 wurde bereits zu diesem Thema Stellung bezogen. Der Ist-Zustand von Mainframe-Fachpersonal auf dem IT-Arbeitsmarkt ist weiter geschrumpft. Die Universität Leipzig hat zusammen mit dem Institut für Informatik der Universitt Tübingen in der Vergangenheit versucht, diesen Prozess aufzuhalten. Es wurden Vorlesungen in Leipzig, Tübingen, Frankfurt (Frankfurt School of Finance & Management) zum Thema Mainframe Server gehalten in der Hoffnung, Studenten und Interessenten in das Mainframe Computing einzuführen. Das vorliegende Lehrbuch Mainframe System z Computing enthält wesentliche Änderungen gegenüber der 1. Auflage des Buches Einführung in z/OS und OS/390. Dazu zählen vordergründig Kapitel zur Hardware aktueller IBM-System-z-Server (z12, z15, z16). Es gibt insbesondere praktische Erkenntnisse und Erfahrungen zu den Themen DB2, IMS, CICS, WebSphere MQ, SMF und z/OS Connect EE. Das betrifft z.T. Nutzer mit normalen und auch solche mit Administrator-Rechten.
This book constitutes the refereed proceedings of the 21st International Conference on Artificial Intelligence in Medicine, AIME 2023, held in Portoroz, Slovenia, in June12¿15, 2023.The 23 full papers and 21 short papers presented together with 3 demonstration papers were selected from 108 submissions. The papers are grouped in topical sections on: machine learning and deep learning; explainability and transfer learning; natural language processing; image analysis and signal analysis; data analysis and statistical models; knowledge representation and decision support.
This book explains the ideas behind one of the most well-known methods for knowledge graph embedding of transformations to compute vector representations from a graph, known as RDF2vec. The authors describe its usage in practice, from reusing pre-trained knowledge graph embeddings to training tailored vectors for a knowledge graph at hand. They also demonstrate different extensions of RDF2vec and how they affect not only the downstream performance, but also the expressivity of the resulting vector representation, and analyze the resulting vector spaces and the semantic properties they encode.
This book presents a collection of essays written by leading researchers to honor Roman Slowinski's major scholarly interests and contributions. He is well-known for conducting extensive research on methodologies and techniques for intelligent decision support, where he combines operational research and artificial intelligence. The book reconstructs his main contributions, presents cutting-edge research and provides an outlook on the most promising and advanced domains of computer science and multiple criteria decision aiding. The respective chapters cover a wide range of related research areas, including decision sciences, ordinal data mining, preference learning and multiple criteria decision aiding, modeling of uncertainty and imprecision in decision problems, rough set theory, fuzzy set theory, multi-objective optimization, project scheduling and decision support applications. As such, the book will appeal to researchers and scholars in related fields.
Tilmeld dig nyhedsbrevet og få gode tilbud og inspiration til din næste læsning.
Ved tilmelding accepterer du vores persondatapolitik.