DCF Results - Similar News
Often times when working with results from the Digimind Content Factory you will see similar news.
DCF results page:
DCF Portlet on a dashboard:
The results from the DCF are clustered together, news stories that contain 80% similar text (based on keywords, contents and title) are considered to be similar.
Most times when you are looking for information, you may not be that interested in seeing similar stories because you are looking for facts, key dates, opinions and other types of information. In this case, the clustering helps you to go a bit faster by keeping the DCF results page free of similar stories.
However, on times you might be very interested in knowing how long a story stayed alive on the internet, where it spread, by which sites, etc.. Clicking on the similar news link will open these news stories and you will be able to check the websites, the titles of each story and then click to read them individually if needs be.
The headline story is presented on the results page of the DCF or in a DCF portlet on a dashboard. All the other stories are clustered behind this one. The headline story is the first story picked up by Digimind. All the other stories were collected at a later moment and clustered behind the earliest one.