RegEx match open tags except XHTML self-contained tags
stackoverflow.com
external-link
I need to match all of these opening tags: <p> <a href="foo"> But not self-closing tags: <br /> <hr class="foo" /> I came up with this and wanted to make

no, this is one of the worst answers on Stack Overflow

OP had a specific question to capture opening tags. The thing OP asked about can be done with regular expressions. It is true that arbitrarily nested languages like HTML cannot generally be parsed with regular expressions, but that is not what OP asked about.

It can’t be done, as an opening tag in html can contain anything in its attributes, even JavaScript (e.g. onclick handler).

??? Non sequitur

You can’t parse every html opening tag with regex, because a html opening tag doesn’t have a set structure. How would you match, with regex, this opening tag? <mytag myattribute="<value of \"myattribute\">" >

Is this valid HTML? My understanding is that that attribute value needs to be escaped, i.e &lt;value of \&quot;myattribute\&quot;&gt;.

The quote must not be escaped when you start with a single quote. The rest doesn’t. This is valid and tested: <img alt='my "<img>"'>

This is StackOverflow after all. Your question is wrong. Your problem is wrong. You are wrong. I am right. Thread locked. Go read this other post that is totally unrelated to your problem I’ve decided isn’t the problem you’re facing because. I. Am. Right.

@errer@lemmy.world
link
fedilink
English
172M

That’s why LLMs are so infuriatingly stubborn, they’re trained on these keyboard warriors

@Quetzalcutlass@lemmy.world
link
fedilink
English
19
edit-2
2M

Could be worse. At least it’s not Microsoft’s support forums:

Hey, I see you’re having problems with <copy-paste key words from OP>. Try the following and see if it fixes your issue.

Open a command prompt and enter ”sfc /scannow".

I hope this helps!

(Reply marked as solution, thread closed.)

I have X years experience with {keyword salad}.

Can you confirm {details already in the opening post}?

Skull giver
link
fedilink
62M

deleted by creator

Magic may be an overstatement. I would be shocked if any of them fixed even 0.1% of the problems posted to Microsoft’s joke of a support forum where they were presented as solutions.

@lud@lemm.ee
link
fedilink
42M

answers.mirosoft.com is the worst. learn.microsoft.com can be decent at times though

JackbyDev
link
fedilink
English
32M

I had a decade old question marked as a duplicate and downvoted three times after years no no activity. SE is such a joke nowadays.

kbal
link
fedilink
02M

It can be done with simple regex of the kind proposed in various answers there iff the html is known to be limited to the subset of html where that sort of thing can easily be made to work. The question does not tell us whether or not that is the case, so everyone is free to make their own assumptions and argue as if they know what’s going on.

Create a post

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

  • Posts must be relevant to programming, programmers, or computer science.
  • No NSFW content.
  • Jokes must be in good taste. No hate speech, bigotry, etc.
  • 1 user online
  • 144 users / day
  • 303 users / week
  • 694 users / month
  • 2.83K users / 6 months
  • 1 subscriber
  • 1.56K Posts
  • 34.7K Comments
  • Modlog