Working with Enterprise Search Relevancy Challenges

When enterprise searches are built from scratch, evaluation of the search quality remains key challenges of organizations implementing it. It always gives a feel of living in the darkness all the time. Such implementations demand enormous efforts and time. The chart below demonstrates a typical challenging situation in which organizations invest and work consistently on maturing the quality of searches over time, and yet remain far from satisfaction.

Read More

SOLR Security with ManifoldCF

This article explains how to implement SOLR "document level security" using Manifold Connector Framework. ManifoldCF is an open source framework for pulling content out of a repository and sending it on to targets such as SOLR via a plug-in style and connector-based architecture.

Read More

Building Docker image with Solr

There are two ways to build docker image:
  1. Running an image, modifying and committing it. This requires to access live container.
  2. Using Dockerfile and build it.
Let's take an example of creating a docker image with solr from scratch.

Read More

Getting Started With Docker

According to the docker's website, "Docker is an open platform for developers and sysadmins to build, ship, and run distributed applications."
In simple words, it's one of the methods to run and deploy your software application. Docker allows you to create lightweight "virtual machines". Here lightweight virtual machines are nothing but docker containers.

Read More

Using Solr and TikaOCR to search text inside an image

Tesseract is probably the most accurate open source OCR engine available and with Apache Tika 1.7 you can now use the awesome Tesseract OCR parser within Tika! Solr 5.x has support for Tika 1.7 (See this) . I wanted to try this in Solr 5.2 so I configured this on my machine, Below are the steps required to make TikaOCR work with Solr 5.2.

Read More

Solr: Backed up? Now you can restore it back

One of the lesser known but cool features of ReplicationHandler is support for index backup. You must have used ReplicationHandler in your project for replicating index from master to slave instances. if you want to take backup of index, you can do it as follows:

Read More

Ontologies Vs Taxonomies Vs Thesauri, and its place on the Semantic Web

An ontology formally defines a common set of terms that are used to describe and represent a domain. An ontology is domain specific, and it is used to describe and represent an area of knowledge. It contains terms and the relationships among these terms. There is another level of relationship expressed by using a special group of terms: properties.

Read More

Solr Optimistic Concurrency Unlocked!

If you have multiple clients updating documents, it's really critical to ensure that newer version of the document is never overwritten by the older version. To address this problem, what you need is concurrency control, which is the process of managing simultaneous update of documents.

Read More

Request a Live Demonstration

Take the first 2 steps towards relevant deep insights with our 3RDi Enterprise Platform
  1. Register and get the test login details for 3RDi Enterprise Suite
  2. Take a tour, start experiencing

Existing users can click here


Reach out to us for any questions or queries

By submitting this form, you are consenting to receive emails from The Digital Group team, you can unsubscribe at any time.