solr

How to Monitor Solr servers and Solr cluster

Solr Monitor is a unique tool developed by one of the folks in Quikr and published  under open source Licence. This is the only tool available currently in the open source community to monitor the solr servers. Solr is an open source enterprise search service from the Lucene project. Solr is written in Java and(…)

Solr Replication – High CPU Load – Frequent Commits

we are running a classified site and the latest ads which are coming to the system is indexed in solr write server in every three minutes. Then this index is replicated to solr salve servers, Over a period of time the solr index file size increased to 40 GB .   Now we have noticed that when(…)

How to Monitor Apache Solr Servers

There will be one point of time when you really want to monitor your Solr servers. As you may be wondering that what is going inside the solr servers. Solr is an open-source search server based on the Lucene Java search library  which  is used by many sites to store data as well as handle(…)

What happens when I commit or optimize in Solr

When a commit/optimize is done on master, ReplicationHandler reads the list of file names which are associated with each commit point. This relies on the ‘replicateAfter’ parameter in the configuration to decide when these file names are to be fetched and stored from Lucene.

How does the Solr slave replicate?

The master is totally unaware of the slaves. The slave continuously keeps polling the master (depending on the ‘pollInterval’ parameter) to check the current index version the master. If the slave finds out that the master has a newer version of the index it initiates a  The steps are as follows, Slave issues a filelist(…)

Solr Cache hit rate

Monitor the cache statistics from Solr’s admin! Raising Solr’s cache size is often the best way to improve performance, especially if you notice many evictions for a particular cache type. Pay particular attention to the filterCache, which is also used internally by Solr for facetting

Solr Cache autoWarm Count Considerations

When a new searcher is opened, its caches may be prepopulated or “autowarmed” with cached object from caches in the old searcher. autowarmCount is the number of cached items that will be copied into the new searcher. You will proably want to base the autowarmCount setting on how long it takes to autowarm. You must consider(…)

Solr HashDocSet Max Size Considerations

The hashDocSet is an optimization specified in the solrconfig.xml that enables an int hash representation for filters (docSets) when the number of items in the set is less than maxSize. For smaller sets, this representation is more memory efficient, more efficient to iterate, and faster to take intersections. The hashDocSet max size should be based(…)

Apache solr Commit and optimize

Data sent to Solr is not immediately searchable, nor do deletions take immediate effect. Like a database, changes must be committed first. Unlike a database, there are no distinct sessions (that is transactions) between each client, and instead there is in-effect one global modification state. This means that if more than one Solr client were(…)

How to Verify Solr Replication

How to Validate Solr Data Once you have set the Replication on Solr. The next step is to verify the Apache Solr Replication. The verify process is very simple. Its is very importent to verify the replication and validate Solr Index Data. You can do it by accessing the master and slave in your web(…)

How does the Solr slave replicate

The master is totally unaware of the slaves. The slave continuously keeps polling the master (depending on the ‘pollInterval’ parameter) to check the current index version the master. If the slave finds out that the master has a newer version of the index it initiates a replication process. The steps are as follows, slave issues(…)