BaumGeist@lemmy.ml to Programmer Humor@lemmy.ml · 2 months agoThe best answer on StackOverflow: Using RegEx to parse HTMLstackoverflow.comexternal-linkmessage-square36fedilinkarrow-up1313arrow-down112
arrow-up1301arrow-down1external-linkThe best answer on StackOverflow: Using RegEx to parse HTMLstackoverflow.comBaumGeist@lemmy.ml to Programmer Humor@lemmy.ml · 2 months agomessage-square36fedilink
minus-squareschnurrito@discuss.tchncs.delinkfedilinkarrow-up1arrow-down2·2 months ago??? Non sequitur
minus-squaremoriquende@lemmy.worldlinkfedilinkarrow-up5·2 months agoYou can’t parse every html opening tag with regex, because a html opening tag doesn’t have a set structure. How would you match, with regex, this opening tag? <mytag myattribute="<value of \"myattribute\">" >
minus-squareschnurrito@discuss.tchncs.delinkfedilinkarrow-up1arrow-down1·edit-22 months agoIs this valid HTML? My understanding is that that attribute value needs to be escaped, i.e. <value of \"myattribute\">.
minus-squaremoriquende@lemmy.worldlinkfedilinkarrow-up4·2 months agoThe quote must not be escaped when you start with a single quote. The rest doesn’t. This is valid and tested: <img alt='my "<img>"'>
??? Non sequitur
You can’t parse every html opening tag with regex, because a html opening tag doesn’t have a set structure. How would you match, with regex, this opening tag?
<mytag myattribute="<value of \"myattribute\">" >
Is this valid HTML? My understanding is that that attribute value needs to be escaped, i.e.
<value of \"myattribute\">
.The quote must not be escaped when you start with a single quote. The rest doesn’t. This is valid and tested:
<img alt='my "<img>"'>