Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Complex html list extraction to xpath #291

Closed
trompx opened this issue Jul 7, 2023 · 2 comments
Closed

Complex html list extraction to xpath #291

trompx opened this issue Jul 7, 2023 · 2 comments

Comments

@trompx
Copy link

trompx commented Jul 7, 2023

Hello, thanks for creating dedoc. I am not sure of its complete capabilities. Does it allow to extract list of complex elements from html? Like, detecting patterns in the DOM structure and extract xpath of list of elements. For example, search results which are composed of multiples tags.

@oksidgy
Copy link
Collaborator

oksidgy commented Aug 28, 2023

No, dedoc cannot extract xpath from html format. But you can write own html-handler (your reader) with this functionality and do pull-request. You can read about readers here (https://dedoc.readthedocs.io/en/latest/modules/readers.html)

@NastyBoget
Copy link
Collaborator

The instruction how to add your own reader to dedoc can be found here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants