Applied text analysis with Python : enabling language-aware data products with machine learning / Benjamin Bengfort, Rebecca Bilbro, and Tony Ojeda.
By: Bengfort, Benjamin [author.].
Contributor(s): Bilbro, Rebecca [author.] | Ojeda, Tony [author.].
Publisher: Sebastopol, CA : O'Reilly Media, Inc., ©2018Copyright date: ©2018Edition: First edition.Description: 310 p: illustrations ; 25 cm.Content type: text | still image Media type: unmediated Carrier type: volumeISBN: 1491963042; 9781491963043.Subject(s): Natural language processing (Computer science) | Python (Computer program language) | Machine learning | Machine learning | Natural language processing (Computer science) | Python (Computer program language)Genre/Form: Print books.Current location | Call number | Status | Date due | Barcode | Item holds |
---|---|---|---|---|---|
On Shelf | QA76.73.P98 B454 2018 (Browse shelf) | Available | AU00000000016547 |
Browsing Alfaisal University Shelves , Shelving location: On Shelf Close shelf browser
QA76.73 .P98 2022 Machine learning with Pytorch and Sscikit-Learn : develop machine learning and deep learning models with scikit-learn and PyTorch | QA76.73 .P98 2023 Beginning Programming with Python For Dummies, / | QA76.73 .P98 2023 Publishing Python packages : test, share, and automate your projects / | QA76.73.P98 B454 2018 Applied text analysis with Python : enabling language-aware data products with machine learning / | QA76.73.P98 C465 2021 Deep learning with Python / | QA76.73.P98 D45 2020 Intro to Python for the computer and data sciences : learning to program with AI, big data and the cloud / | QA76.73.P98 D45 2020 Intro to Python for computer science and data science: learning to program with AI, big data, and the cloud |
Includes index.
Language and computation -- Building a custom corpus -- Corpus preprocessing and wrangling -- Text vectorization and transformation pipelines -- Classification for text analysis -- Clustering for text similarity -- Context-aware text analysis -- Text visualization -- Graph analysis of text -- Chatbots -- Scaling text analytics with multiprocessing and Spark -- Deep learning and beyond.
From news and speeches to informal chatter on social media, natural language is one of the richest and most underutilized sources of data. Not only does it come in a constant stream, always changing and adapting in context; it also contains information that is not conveyed by traditional data sources. The key to unlocking natural language is through the creative application of text analytics. This practical book presents a data scientist's approach to building language-aware products with applied machine learning. You will learn robust, repeatable, and scalable techniques for text analysis with Python, including contextual and linguistic feature engineering, vectorization, classification, topic modeling, entity resolution, graph analysis, and visual steering. By the end of the book, you'll be equipped with practical methods to solve any number of complex real-world problems.- Preprocess and vectorize text into high-dimensional feature representations - Perform document classification and topic modeling - Steer the model selection process with visual diagnostics - Extract key phrases, named entities, and graph structures to reason about data in text - Build a dialog framework to enable chatbots and language-driven interaction - Use Spark to scale processing power and neural networks to scale model complexity.--