Loading…
ApacheCon EU 2016 has ended
ApacheCon Europe 2016
Click here to Register or for more information 

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Search [clear filter]
Friday, November 18
 

11:00

Lucene And Solr Document Classification - Alessandro Benedetti, loveholidays.com
This presentation will start by introducing how Apache Lucene can be used to classify documents using data structures that already exist in your index instead of having to generate and supply external training sets.

Building on the introduction the focus will be on extensions of the Lucene Classification module that come in Lucene 6.0 and the Lucene Classification module's incorporation in to Solr 6.1. These extensions will allow you to classify at a document level with individual field weighting, numeric field support, lat/lon fields etc.

The Solr ClassificationUpdateProcessor will be explored, such as how it works, and how to use it including basic and advanced features like multi class support and classification context filtering.

The presentation will include practical examples and real world use cases.

Speakers
avatar for Alessandro Benedetti

Alessandro Benedetti

Senior Search Software Engineer, Sease Ltd
Alessandro Benedetti is a Search Consultant and R&D Software Engineer at Sease Ltd. His focus is on information retrieval, information extraction, natural language processing, and machine learning. At Sease Alessandro is working as a freelance on Search/Machine learning projects and... Read More →



Friday November 18, 2016 11:00 - 11:50
Carmona

12:00

Building a Search Engine for the Cuban Web - Nicolas Malin & Julien Nicolas, Nereide
This talk will cover the transition of Solr from "just the inverted index for search" into the core's technology of a Web Search Engine for the Cuban Web. The main purpose is to show how some of the more common features of today web search engines could be fulfilled by the use Apache Solr, which makes Solr the hearth of our system. Integration with several Apache projects will be covered and how this systems work together to build a full featured Web Search Engine, an Image Search Engine and a Real Time News search engine with alert capabilities all of this powered by the features offered by Solr and several Apache projects. Also the use of Solr itself to help monitor and run the different components of the system will be discussed. Essentially how to build a Web Search Engine using the power of the Apache Foundation.

Speakers
avatar for Jorge Betancourt Gonzalez

Jorge Betancourt Gonzalez

University of Informatic Sciences
Software Engineer with more than 5 years of experience using Java. Working with search engines for over 3 years, specially Apache Solr. Have done some consultancy work in the field of Web Crawling and NLP/Text Processing. Currently building a search engine for the Cuban Web. Interested... Read More →


Friday November 18, 2016 12:00 - 12:50
Carmona