AI’s take on XML

Anna

Is this a tactic used by skynet to lure all humans together and then…BANG!!!

annoyed-onion

I mean, it’s not wrong…

@Serinus@lemmy.world

Disagree. I prefer XML for config files where the efficiency of disk size doesn’t matter at all. Layers of XML are much easier to read than layers of Json. Json is generally better where efficiency matters.

@Hexarei@programming.dev

TOML or bust

@renzev@lemmy.world

yes.

@Matriks404@lemmy.world

Aren’t most XML parsers faster than JSON parsers anyway?

@goferking0@lemmy.sdf.org

Wishful thinking

Gregor

a wate of time

@expatriado@lemmy.world

stuff like this is how reddit found out their users comments were being used 😂

@renzev@lemmy.world

Fonzie!

I’m sorry which LLM is this? What are its settings? How’d you get that out of it?

And how did it give sources?

@renzev@lemmy.world

I’m sorry which LLM is this?

It’s perplexity.ai. I like it because it doesn’t require an account and because it can search the internet. It’s like microsoft’s bing but slightly less cringe.

How’d you get that out of it?

The screenshot is fake. I used Inspect Element.

Fonzie!

perplexity.ai

Like DuckDuckGo’s AI’s, but with sources? Sounds cool, thanks!

fake

Ah… Too bad (:

@renzev@lemmy.world

Never knew that ddg had an LLM, will check it out. Thanks!

Fonzie!

It’s a proxy for a number of LLMs of choice, prompts anonymised before they’re sent. A bit like how their search engine is anonymised Bing, or how their maps are anonymised Apple Maps. I’m happy with the service!

@pfjarschel@lemmy.world

The answer is not real. The tool, on the other hand, is called Perplexity. It “understands” your question, searches the web, and gives you a summary, citing all the relevant sources.

Fubarberry

AI is starting to get really smart

@Batman@lemmy.world

A word document is xml

@renzev@lemmy.world

zipped xml!

clb92

Lots or file formats are just zipped XML.

I was ~~reverse engineering~~ fucking around with the LBX file format for our Brother label printer’s software at work, because I wanted to generate labels programmatically, and they’re zipped XML too. Terrible format, LBX, really annoying to work with. The parser in Brother P-Touch Editor is really picky too. A string is 1 character longer or shorter than the length you defined in an attribute earlier in the XML? “I’ve never seen this file format in my life,” says P-Touch Editor.

@SzethFriendOfNimi@lemmy.world

Sounds like it’s actually using XSLT or some kind of content validation. Which to be honest sounds like a good practice.

clb92

Here’s an example of a text object taken from the XML, if you’re curious: https://clips.clb92.xyz/2024-09-08_22-27-04_gfxTWDQt13RMnTIS.png

EDIT: And with more complicated strings (like ones havingnumbers or symbols - just regular-ass ASCII symbols, mind you) there will be tens of <stringItem>, because apparently numbers and letters don’t even work the same. Even line breaks have their own <stringItem>. And if the number of these <stringItem> and their charLen don’t match what’s actually in pt:data, it won’t open the file.

@SzethFriendOfNimi@lemmy.world

Is it because of the lower case Latin æ since it’s technically one character even if two bytes?

clb92

Nope, doesn’t seem like it.

@SzethFriendOfNimi@lemmy.world

What a mess… sounds like the devs got burned by various Unicode edge cases RTL, etc

@Batman@lemmy.world

The future if text documents were Json:

City_pic.png.xml

BlackEco

RSS/ATOM has to be the best thing to come out of XML

Fontasia

OH HEY EVERYONE, EVERYONE, THIS GUY LIKES JSON

Fuck you and your unstructured garbage.

@LittleWizard@feddit.org

I hate writing xml with a passion

@Knock_Knock_Lemmy_In@lemmy.world

If you are writing it then you are doing it wrong.

@MoogleMaestro@lemmy.zip

I hate writing a serialized format

I mean, that’s why it’s serialized. It’s not supposed to be written by hand, that’s why you have a deserializer. 🤦

@smeenz@lemmy.nz

What about writing in xml without any passion ?

𝘋𝘪𝘳𝘬

This is fine.

@toastal@lemmy.ml

Balisage Paper: Fat Markup: Trimming the Fat Markup Myth one calorie at a time

https://www.balisage.net/Proceedings/vol10/html/Lee01/BalisageVol10-Lee01.html

XML is a fine format in comparison to JSON.

@Takumidesh@lemmy.world

I hate writing and reading xml compared to json, I don’t really care if one is slightly leaner than the other. If your concern is the size or speed you should probably be rethinking how you serialize the data anyway (orotobuff/DB)

@toastal@lemmy.ml

I mean we have a generation that thinks XML is bloated & JSON is superior but those two formats are about the same on performance & compressed size–which was the point. The non-plaintext-readable formats are superior along a lot of metrics but harder to debug & ultimately less common.

@flashgnash@lemm.ee

Listen we all know deep down the solution is to try to parse it with regex

@AeonFelis@lemmy.world

XML is good for markup. The problem is that people too often confuse “markup” and “serialization”.

@bss03@infosec.pub

Too redundant, just use S-exprs.

(Mostly joking, but in some cases…)

Dr. Bluefall

Unironically.

Given the choice between S-expressions and XML, I will choose S-expressions.

Tomkoid

BASED. What is the name of this AI? I want to use this.

@lemmesay@discuss.tchncs.de

coral by cohere

no wait, it’s perplexity, I remember the logo.
you can try their labs version which gives to access to latest and beefy models like llama3.1 70b

@umbrella@lemmy.ml

finally accurate ai

@RunningInRVA@lemmy.world

Except for obvious typos

@whoisearth@lemmy.ca

XML is fine. Namespaces have a special place in hell though

AI’s take on XML

AI’s take on XML

Programmer Humor

Rules: