This introductory book for text and web mining utilizes only open source tools to illustrate and teach unique use case applications. Each use case consists of a fully implemented application based on a real text/web mining example. Data and clear instructions to implement the use case are provided in each chapter. Readers will investigate at least ten different tools such as R, RapidMiner, Knime, Natural Language Toolkit, Open NLP, GATE, Gephi, and ManyEyes to learn text visualization and novel techniques.
Arvustused
"The timing of this book could not be better. It focuses on text mining, text being one of the data sources still to be truly harvested, and on open-source tools for the analysis and visualization of textual data. Markus and Andrew have done an outstanding job bringing together this volume of both introductory and advanced material about text mining using modern open-source technology in a highly accessible way." From the Foreword by Professor Dr. Michael Berthold, University of Konstanz, Germany
RapidMiner. KNIME. Python. R.
Markus Hofmann is a lecturer at the Institute of Technology Blanchardstown, where he focuses on the areas of data mining, text mining, data exploration and visualization, and business intelligence. Dr. Hofmann has also worked as a technology expert with 20 different organizations, such as Intel. He earned a PhD from Trinity College Dublin, an MSc in computing from the Dublin Institute of Technology, and a BA in information management systems.
Andrew Chisholm is a certified RapidMiner Master who created both basic and advanced RapidMiner video training content for RapidMinerResources.com. He has worked as a software developer, systems integrator, project manager, solution architect, customer-facing presales consultant, and strategic consultant. He earned an MSc in business intelligence and data mining from the Institute of Technology Blanchardstown and an MA in physics from Oxford University.