Loading…
ApacheCon EU 2016 has ended
ApacheCon Europe 2016
Click here to Register or for more information 
Back To Schedule
Friday, November 18 • 11:00 - 11:50
Lucene And Solr Document Classification - Alessandro Benedetti, loveholidays.com

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

This presentation will start by introducing how Apache Lucene can be used to classify documents using data structures that already exist in your index instead of having to generate and supply external training sets.

Building on the introduction the focus will be on extensions of the Lucene Classification module that come in Lucene 6.0 and the Lucene Classification module's incorporation in to Solr 6.1. These extensions will allow you to classify at a document level with individual field weighting, numeric field support, lat/lon fields etc.

The Solr ClassificationUpdateProcessor will be explored, such as how it works, and how to use it including basic and advanced features like multi class support and classification context filtering.

The presentation will include practical examples and real world use cases.

Speakers
avatar for Alessandro Benedetti

Alessandro Benedetti

Senior Search Software Engineer, Sease Ltd
Alessandro Benedetti is a Search Consultant and R&D Software Engineer at Sease Ltd. His focus is on information retrieval, information extraction, natural language processing, and machine learning. At Sease Alessandro is working as a freelance on Search/Machine learning projects and... Read More →



Friday November 18, 2016 11:00 - 11:50 CET
Carmona