XPATH to extract data from CarWale.com?
With help from a friend I made a script to extract all specs and features from pages like http://www.carwale.com/mercedesbenz-cars/e-class/e63amg-3049/ , it works but not perfectly. He told me to use XPath //tr[contains (.,"FEATURE NAME")]/td, but one of them is impossible to pick, using //tr[contains (.,"Display")]/td it extract 4 features containing word Display. Is there any way to pick only the one labelled exactly Display? <td>Trip Meter</td><td>Multi-Function Display </td> <td>Heads Up Display (HUD)</td><td>No </td> <td>Display</td><td>LCD Display </td> <td>Display Screen for Rear Passengers</td><td>No </td> I also extracted car color names using XPath //div[#class='colorName'] I want also car color RGB values, or whole style code and remove unneeded code using find/replace, what XPath I need? <div class="colours" style="background-color: #040404; height: 30px; width: 130px; margin: 7px"></div>
Extract 'td' tag containing 'Display' if it's preceding sibling contains 'Display': //tr/td[contains(.,'Display')]/following-sibling::td[contains(.,'Display')] Extract RGB hex string: //div/substring-before(substring-after(#style,'background-color: '),';')
How can I use an XPATH to populate a drop-list in Sitecore WFM?
How to get both following-sibling::text() and following-sibling::b?
Scrapy xpath fail to find certain div in a webpage
Scrapy: Issues in dealing with Abbr tag in Xpath
XPath for ImportXML in Gsheet
XMLStarlet: selecting nodes using less than / greater than
Date range comparison in CQ using XPATH
I don't understand why this XPath expression is not working as a Scrapy selector
Selenium IDE, check (assert) if a dynamic element contains a specific text
xpath help to get buttons under a class where a link contains some href value
How to write xpath for below code displayed on Image
Fetch element child elements in XQuery
Quickly extract value using xpath
WSO2 ESB- Error Handling - On Error Sequence
Multiple xpath expressions
great ancestor & great great ancestor