We have developed a new way to track links in case our webnews algorithm do not manage to extract the data.
The following steps explain how to proceed in order to set up your monitoring
1- Try to add your Webnews page using our smartracker
2/ Add your link
In this specific case you can notice that the system has found a webpage and not a webnews.
More information regrarding webpage and webnews, please read :
https://support.digimind.com/hc/en-us/articles/214263786-RSS-Web-News-or-Webpage
With our new weblinks monitoring system you can do an advanced setup and choose the part of the page in which you want to extract the data
3/ Click on the advanced settings link and choose add a webnews source
4/ Add your url and click on next
5/ Choose Extract all links from Xpath
1) By default the system will crawl all the weblinks on the page (you probably will have noise with this option)
2) Click on refresh to have a preview of the weblinks the system will manage to extract (if there is no result that means the system can not extract the links from this page)
An advanced setup can be done by adjusting the xpath and choose exactly the area of the page you want to track
How to find the righ xpath ?
Let’s take the example of the following website : https://www.lejournaldesentreprises.com/lyon-saint-etienne-grenoble
Imagine you want to track the EN BREF area
Put your mouse on this area then right click and choose “inspect element”
The web console will open at the bottom of your page
Navigate through the inspector panel until the area you want to track is highlighted (in blue in my example)
Then right click and choose copy Xpath
Go back to Digimind and paste the xpath in the Extract all links from XPath area
Click on refresh to obtain a preview
If you are satisfied with the result click on track to add this source to your monitoring.
Comments
0 comments
Please sign in to leave a comment.