Skip to content

Xpath lxml python wrapper #903

@siwelwerd

Description

@siwelwerd

Previously, we used a command line tool called xpath to find, for example, all <activity>s that had a <answer> but no <statement> (#763), see also #512. This command line implementation had a perl dependency, and thus is not present in the Docker image we are using. @mitchkeller shared with me a couple scripts that use a python implementation of Xpath to do exactly this task (in this case, find all <image>s lacking a <description>; see #898), so they should run just fine in the Docker image.

I think I want to generalize Mitch's second script slightly since we have found other use cases for this previously, but it should not be hard to pass a couple things as a command-line argument and feed those into xpath within the python script.

Metadata

Metadata

Assignees

Labels

technicalRequires modifying project's technology infrastructure or non-trivial programming

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions