scone.robot
Interface PageFilter
public interface PageFilter
This interface defines the functionaltity of a page filter.
- Author:
- Frank Wollenweber
filter
boolean filter(RobotHtmlNode robotHtmlNode,
QueueEntry qe)
- When a page is loaded the filter method is called for the page.
Filter gets a robotHtmlNode and can use the getAttribute method to get the attributes attached to the robotHtmlNode.
This method decides, if the crawling should be continued or not. The values of all filters are combined with the and operator
- Parameters:
robotHtmlNode
- to filterqe
- The entry of the queue which causes the robot to download robotHtmlNode. This parameter gives the filter access to the RobotTask and the current scanning-depth.
- Returns:
- True, if the crawling should be continued.