How can I integrate the Apache Solr Search with my Drupal 7 Site? - apache

Can anyone Give me the Good tutorial links which will be helpful to me so I can check that How to Integrate the Solr Search with my Drupal Site to get good performance.
What are the modules available for Drupal 7.x Version of Apache Solr Search.
Which version of Solr will support the Drupal 7.x.
What are the Configuration should required in Apache Solr / Drupal 7.x to Search?

There are two modules that support Solr with Drupal that are widely used:
Search API Solr
ApacheSolr search
Both have their various configuration 'quirks', I'd say you'd need to try both to see how they fit in with your site, to see which suits you best.
Make sure you have Java 5 or higher installed already on your server.
Tutorial on setting up site with Search API for Solr
Tutorial on setting up site with ApacheSolr

Look at and use Search API - Apache Solr module will not be the way forward for the future of Drupal.
The two maintainers of Search API and Apache Solr have meet in person and have determined a way forward for advanced searches with Drupal and they both have agreed that Search API is it.

Related

Full-text search on microsoft docs using Apache SolR

Does Apache Solr allow for full text search on Microsoft documents such as word or powerpoint? if so, where can I find a tutorial?
Yes. Solr uses Apache Tika for content extraction and support the majority of file types.
You'll need to configure a handler in your solrconfig.xml.
Here's a good starting documentation with examples:
https://lucene.apache.org/solr/guide/6_6/uploading-data-with-solr-cell-using-apache-tika.html

Any descriptive tutorials or clear guidance on crawling web by apache solr 6.6

I read on this question page that solr 5+ supports web crawling which means that we no longer need nutch. Are there any examples or descriptions to explain how to set up solr 6.6 crawl a set of remote websites?
they most probably meant using DIH with the right Datasource, but I doubt this can replace Nutch and such in many scenarios.

Viewing Apache solr logs on windows

I have drupal based site with solr integration. My localhost is on windows and the live site on Linux.
How do I enable and view solr logging for both setups? I can see a log folder in my localhost but its empty.
Just to elaborate, solr search etc works great in both setups. However I built a solr view that works perfectly on local but gives less accurate results on live. So I wanted to see the final solr queries being built to see the source of the difference.
While starting the Solr instance pass the following parameter to enable Solr logging to file.
-Djava.util.logging.config.file=etc/logging.properties
Then modify your /example/etc/logging.properties inside you Solr instance to customize your logging pattern.
Using Solr Version: Apache Solr 8.9.0
You could use the Solr Administration User Interface
Go to Solr Admin UI and click the link for "Logging".
Then you will see log info.
Selecting the Level link on the left, you see the hierarchy of classpaths and classnames for your instance.

What is the SOLR plugin for Liferay meant for?

I am using Liferay 6.1 and I am trying to learn how to incorporate search functionality into Liferay Portal. I was able to run Apache SOLR inside Liferay's tomcat container but I don't understand what the solr plugin for liferay is meant for.
Here is the link
Can someone please explain what are the benefits for using the plugin (for liferay) and what it accomplishes on top of using SOLR?
Thanks
Per this link it is to externalize the search function from the portal.
Using Solr instead of Lucene gives you the additional capabilities of Solr such as Replication, Sharding, Result clustering through Carrot2, Use of custom Analyzers/Stemmers etc.
It also can offload search server processing to a separate cluster.
Opens up the possibilities of search driven UI (facetted classification etc) separate from your portal UI.

Using Nutch crawler with Solr

Am I able to integrate Apache Nutch crawler with the Solr Index server?
Edit:
One of our devs came up with a solution from these posts
Running Nutch and Solr
Update for Running Nutch and Solr
Answer
Yes
If you're willing to upgrade to nutch 1.0 you can use the solrindex as described in this article by Lucid Imagination: http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/.
It's still an open issue. If you're feeling adventurous you could try applying those patches yourself, although it looks like it's not so simple
nutch 2.x is designed to use solr as default. You can follow the steps in http://wiki.apache.org/nutch/Nutch2Tutorial, or a better instruction in the book "Web Crawling and Data Mining with Apache Nutch".