Complex html list extraction to xpath #291

trompx · 2023-07-07T14:46:09Z

Hello, thanks for creating dedoc. I am not sure of its complete capabilities. Does it allow to extract list of complex elements from html? Like, detecting patterns in the DOM structure and extract xpath of list of elements. For example, search results which are composed of multiples tags.

oksidgy · 2023-08-28T14:53:08Z

No, dedoc cannot extract xpath from html format. But you can write own html-handler (your reader) with this functionality and do pull-request. You can read about readers here (https://dedoc.readthedocs.io/en/latest/modules/readers.html)

NastyBoget · 2023-10-06T09:54:41Z

The instruction how to add your own reader to dedoc can be found here

NastyBoget closed this as completed Oct 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Complex html list extraction to xpath #291

Complex html list extraction to xpath #291

trompx commented Jul 7, 2023

oksidgy commented Aug 28, 2023

NastyBoget commented Oct 6, 2023

Complex html list extraction to xpath #291

Complex html list extraction to xpath #291

Comments

trompx commented Jul 7, 2023

oksidgy commented Aug 28, 2023

NastyBoget commented Oct 6, 2023