Comparing two HTML pages?
Posted: 2014-01-31T12:36:57-07:00
Can I use imagemagick to compare two html pages as images and find the similarity %?
Basically I have lot of pages(which would be rendered with html), which needs to be compared with the ones provided screens(by user)
Examples
1. Question/answer page with 4 radio buttons --> Compare all text is same, all 4 radio buttons are vertical so if in other image they are horizontally placed, it should reduce the similarity etc
2. Page with multiple images and the page to be compared will have all the images. --> It should ignore minor noise, background differences etc
I was thinking of converting html page to image and then apply sift comparison algorithm
Any other ideas?
Basically I have lot of pages(which would be rendered with html), which needs to be compared with the ones provided screens(by user)
Examples
1. Question/answer page with 4 radio buttons --> Compare all text is same, all 4 radio buttons are vertical so if in other image they are horizontally placed, it should reduce the similarity etc
2. Page with multiple images and the page to be compared will have all the images. --> It should ignore minor noise, background differences etc
I was thinking of converting html page to image and then apply sift comparison algorithm
Any other ideas?