solr
Solr Replication – High CPU Load – Frequent Commits
we are running a classified site and the latest ads which are coming to the system is indexed in solr write server in every three minutes. Then this index is replicated to solr salve servers, Over a period of time the solr index file size increased to 40 GB . Now we have noticed that when [...]
How to Monitor Apache Solr Servers
There will be one point of time when you really want to monitor your Solr servers. As you may be wondering that what is going inside the solr servers. Solr is an open-source search server based on the Lucene Java search library  which  is used by many sites to store data as well as handle [...]
What happens when I commit or optimize in Solr
When a commit/optimize is done on master, ReplicationHandler reads the list of file names which are associated with each commit point. This relies on the ‘replicateAfter’ parameter in the configuration to decide when these file names are to be fetched and stored from Lucene.
How does the Solr slave replicate?
The master is totally unaware of the slaves. The slave continuously keeps polling the master (depending on the ‘pollInterval’ parameter) to check the current index version the master. If the slave finds out that the master has a newer version of the index it initiates a  The steps are as follows, Slave issues a filelist [...]
Solr Cache hit rate
Monitor the cache statistics from Solr’s admin! Raising Solr’s cache size is often the best way to improve performance, especially if you notice many evictions for a particular cache type. Pay particular attention to the filterCache, which is also used internally by Solr for facetting
Solr Cache autoWarm Count Considerations
When a new searcher is opened, its caches may be prepopulated or “autowarmed” with cached object from caches in the old searcher. autowarmCount is the number of cached items that will be copied into the new searcher. You will proably want to base the autowarmCount setting on how long it takes to autowarm. You must consider [...]
Solr indexing – Master/Slave replication, how to handle huge index and high traffic
- Does Solr have some limitation in size for its index? – Is there any max size when replicating segments from master to slave?
Solr HashDocSet Max Size Considerations
The hashDocSet is an optimization specified in the solrconfig.xml that enables an int hash representation for filters (docSets) when the number of items in the set is less than maxSize. For smaller sets, this representation is more memory efficient, more efficient to iterate, and faster to take intersections. The hashDocSet max size should be based [...]
Apache solr Commit and optimize
Data sent to Solr is not immediately searchable, nor do deletions take immediate effect. Like a database, changes must be committed first. Unlike a database, there are no distinct sessions (that is transactions) between each client, and instead there is in-effect one global modification state. This means that if more than one Solr client were [...]
How to Verify Solr Replication
How to Validate Solr Data Once you have set the Replication on Solr. The next step is to verify the Apache Solr Replication. The verify process is very simple. Its is very importent to verify the replication and validate Solr Index Data. You can do it by accessing the master and slave in your web [...]
How does the Solr slave replicate
The master is totally unaware of the slaves. The slave continuously keeps polling the master (depending on the ‘pollInterval’ parameter) to check the current index version the master. If the slave finds out that the master has a newer version of the index it initiates a replication process. The steps are as follows, slave issues [...]
How to Set up a Repeater for Solr Replication
A master may be able to serve only so many slaves without affecting performance. Some organizations have deployed slave servers across multiple data centers. If each slave downloads the index from a remote data center, the resulting download may consume too much network bandwidth. To avoid performance degradation in cases like this, you can configure [...]
How to Configure Apache Solr Replication
It is very surpricing to see that there are only few means very few details about the replication of solr server. So i desided to give a simple howtos on solr replication . Solr is an open source enterprise search server based on the Lucene Java search library, with XML/HTTP and JSON APIs, hit highlighting, [...]
