Show me a better text format for serializing

onlinepersona@programming.dev · 1 year ago

Show me a better text format for serializing

suy@programming.dev · 1 year ago

Norway.

zorronsfw@lemmynsfw.com · edit-2 1 year ago

deleted by creator

magic_lobster_party@kbin.social · 1 year ago

JSON for serialization all the way. It’s simple and to the point. It does one thing and does it well. There’s little room for annoying surprises. Any JSON can easily be minified and prettified back and forth. If you want it in binary format you can convert it to BSON.

Yaml is too much of a feature creep. It tries to do way too many things at the same time. There are so many traps to fall into if you’re not cautious enough. The same thing can be written in multitudes of ways.

jjjalljs@ttrpg.network · edit-2 1 year ago

Yes, but whoever decided that json can’t have trailing commas has my ire.

{ "a": 1,
  "b": 2,  &lt;-- nope
}

There was some other pitfall I can’t remember around missing keys and undefined, too, but I can’t remember it now.

OmnipotentEntity@beehaw.org · 1 year ago

Change to Haskell formatted commas and the problem goes away :D

{ "a": 1
, "b": 2
, "c":
    [ 3
    , 6
    , 9
    ]
}

ursakhiin@beehaw.org · 1 year ago

Where is the nearest fire to dump this comment in?

adambowles@feddit.uk · 1 year ago

Trailing commas are supported in json5, as well as comments

Sgtmoustache@lemmy.world · 1 year ago

I’m pretty sure you can have trailing commas…

Bjornir@programming.dev · 1 year ago

You can’t but some parser allow them. But those that do do not respect the standard.

Sgtmoustache@lemmy.world · 1 year ago

Good to know. Must be why I thought you could. Thanks.

adambowles@feddit.uk · 1 year ago

deleted by creator

kool_newt@lemm.ee · edit-2 9 months ago

deleted by creator

Kogasa@programming.dev · 1 year ago

Toml

interolivary@beehaw.org · edit-2 1 year ago

There’s a special place in hell for the inventor of semantically significant whitespace.

YAML itself is one of the circles of hell. You have to copy-paste YAML from web etc sources with dubious formatting for all eternity, and the editor doesn’t have YAML support. Also you can only use Python

magic_lobster_party@kbin.social · edit-2 1 year ago

Indenting copy pasted yaml is always a pain in the butt. Any indentation you give is likely a valid yaml. Especially bad considering indentation has a significant meaning. You have to double check back and forth to ensure nothing bad has sneaked in.

With JSON there are no such discrepancies. It’s likely the editor has figured it out for you already. If it hasn’t it’s easy to prettify the JSON yourself.

interolivary@beehaw.org · 1 year ago

Semantic whitespace problems can easily be literally impossible to solve automatically. One of the dumbest fucking ideas anybody ever came up with in computing and its inventor if anyone belongs in YAML Hell. As a fuckup it’s not quite as bad as null, but that ain’t exactly a high bar

synae[he/him]@lemmy.sdf.org · 1 year ago

Sounds like a good time to me!

interolivary@beehaw.org · 1 year ago

I’m not sure which thought is scarier: that you don’t know what you’re signing up for, or that you do know and you enjoy fixing undecidable formatting fuckups manually

synae[he/him]@lemmy.sdf.org · 1 year ago

There’s a bonus third option: I started writing python professionally in 2007 and nowadays spend 75% of my “hands on keyboard” time working on kubernetes YAML and I am indeed having a good time.

I admit, I hastily misread the tail end of your comment as (e.g.) “A reason YAML is bad because you have to copy-paste from the web and that sucks”; not as you probably meant it “in this special hell, you must deal with copy-pasted nbsp and other trash”. So maybe I did not know exactly what I was signing up for ;)

I don’t deal with anything like that and not entirely sure how it happens to people enough that it is a common complaint. “undecidable formatting fuckups” are a non-issue in my life, I really don’t understand how people encounter such things. Maybe they need to fix their editor/IDE/tools? Skill issue? IDK.

As a tangent- I don’t care what language code is written in, it had better be indented properly (and linted, and follow the project’s codestyle, …). Our juniors learn pretty early that their change requests will be blocked on formatting alone by CI, and a human won’t even bother reviewing the substance of their change if they don’t follow convention. I don’t hear them ever complaining about any of these things, least of all semantic whitespace … and we have a rich culture of bitching about menial/pedantic things ;)

NewPerspective@lemmy.world · 1 year ago

My problem with yaml is if you truncate it at any random spot, there’s a high likelihood it’s still valid yaml. I don’t like the idea that things can continue without even knowing there’s a problem. The single opening and closing curly braces enclosing a json object is all it takes to at least know you didn’t receive the entire message. Toml has the same issue. I’ll stick with json when it makes sense.

pimeys@lemmy.nauk.io · 1 year ago

Add a schema to it and you get XML. The ultimate serialization format.

marcos@lemmy.world · 1 year ago

Quite like YAML, XML has too many stuff in it. While a lot of parsers are not standard compliant and safe, if there’s any chance the stuff you include on your code can evolve into a fully featured parser, including it is something to avoid.

There is this language called KDL that looks interesting.

calcopiritus@lemmy.world · 1 year ago

Serializing? For serializing you probably want performance above all else. I’m saying this without checking any benchmark, but I’m sure yaml is more expensive to parse than other formats where indentation don’t have meaning.

For human readability: it has to be readable (and writeable) by all humans. I know (a lot of people) that dislike yaml, toml and XML. I don’t know of a single person that struggles to read/write json, there is a clear winner.

DrM@feddit.de · 1 year ago

JSON would be perfect if it allowed for comments. But it doesn’t and that alone is enough for me to prefer YAML over JSON. Yes, JSON is understandable without any learning curve, but having a learning curve is not always bad. YAML provides a major benefit that is worth the learning curve and doesn’t have the issues that XML has (which is that there is no way to understand an XML without also having the XSD for it)

Michal@programming.dev · 1 year ago

Json should also allow for trailing commas. There’s no reason for it not too. It’s annoying having to maintain commas.

DerArzt@lemmy.world · 1 year ago

And also a standard date time type!

dfyx@lemmy.helios42.de · 1 year ago

What is wrong with ISO 6801 strings?

intensely_human@lemm.ee · 1 year ago

11-2023-14

I dunno it just kinda looks weird to me

Hexarei@programming.dev · 1 year ago

Dunno what format you’ve got there, but ISO 6801 looks like 2023-11-15T18:28:31Z

her01n@lemmy.world · 1 year ago

It’s a joke, because the standard is 8601, not 6801.

frezik@midwest.social · edit-2 1 year ago

JSON5 has comments, among fixing a few other shortsighted limitations of the original.

Kogasa@programming.dev · 1 year ago

If a comment isn’t part of the semantic content of a JSON object it has no business being there. JSON models data, it’s not markup language for writing config files.

You can use comments in JSON schema (in a standardized way) when they are semantically relevant: https://json-schema.org/understanding-json-schema/reference/comments

For the data interchange format, comments aren’t part of the JSON grammar but the option to parse non-JSON values is left open to the implementation. Many implementations do detect (and ignore) comments indicated by e.g. # or //.

frezik@midwest.social · 1 year ago

JSON models data, it’s not markup language for writing config files.

JavaScript package management promptly said otherwise. JSON is a config format no matter if you like it or not.

Kogasa@programming.dev · 1 year ago

I’ve disagreed with JavaScript before, what makes you think I won’t do it again?

Anyway, anything using JSON as a config language will also certainly use a JSON interpreter that can ignore comments. Sure that’s “implementation specific,” but so is a config file. You wouldn’t use “MyApplication.config.json” outside the context of MyApplication loading its own configuration, so there’s no need for it to be strictly compliant JSON as long as it plays nicely with most text editors.

kool_newt@lemm.ee · edit-2 9 months ago

deleted by creator

Blackmist@feddit.uk · 1 year ago

I don’t know why we’re fucking about trying to use text editors to manipulate structured data.

Yeah, it’s convenient to just be able to use a basic text editor, but we’re not trying to cram it all on a floppy disk here. I’m sure we could have a nice structured data editor somewhere for all those XML, JSON and YAML files we’re supposed to maintain every day.

Kogasa@programming.dev · 1 year ago

Serializing isn’t necessarily about performance, or we’d just use protobuf or similar. I agree Json is a great all rounder. Combine with JSON object schema to define sophisticated DSLs that are still readable, plain JSON. TOML is nice as a configuration language, but its main appeal (readability) suffers when applied to complex modeling tasks. XML is quite verbose and maybe takes the “custom DSL” idea a little too far. YAML is a mistake.

synae[he/him]@lemmy.sdf.org · 1 year ago

For serializing? I’d probably just go with json.

For content meant to be written or edited by humans? YAML all day baby

Andy@programming.dev · 1 year ago

Ever tried NestedText? It’s like basic YAML but everything is a string (types are up to the code that ingests it), and you never ever need to escape a character.

synae[he/him]@lemmy.sdf.org · 1 year ago

I’ve got too many consumers that I don’t control which dictate their input formats. And to be quite honest, “types are up to the code that ingests it” sounds like a huge negative to me.

Andy@programming.dev · 1 year ago

Ah, well I love that policy (types being in code, not configs). FWIW I sometimes use it as a hand-edited document, with a small type-specifying file, to generate json/yaml/toml for other programs to load.

MagnoliaMayhem@programming.dev · 1 year ago

Json. Your move, Joker.

Terrasque@infosec.pub · 1 year ago

puts the json in the yaml parser

Your move, foolish mortal

derpgon@programming.dev · 1 year ago

For those uninitiated, every JSON is a valid YAML, since YAML is just a superset of JSON.

colourlessidea@feddit.de · 1 year ago

YAML is pretty good for readability, pretty awful for writability

synae[he/him]@lemmy.sdf.org · 1 year ago

Interesting, I find that the other reasonable options are far less writable than yaml

NewPerspective@lemmy.world · 1 year ago

Rule of thumb: valid json is valid yaml. If you’re ever unsure, do it the old fashioned way.

intensely_human@lemm.ee · 1 year ago

I really don’t see how that’s true.

You’re telling me this is valid yaml?

{ firstName: “Intensely” }

How is that yaml?

NewPerspective@lemmy.world · 1 year ago

Don’t listen to me, put that in a yaml validator for yourself: https://yamllint.com

Fal@yiffit.net · 1 year ago

No, because that’s not valid json

nxdefiant@startrek.website · 1 year ago

If you have a choice to start from scratch, TOML is probably the better option.

urquell@lemm.ee · 1 year ago

So much json here. All wrong, it’s csv

Artyom@lemm.ee · edit-2 1 year ago

Yaml is a great, human-readible file format. Unless there’s an exclamation point in it, then it is an illegible Eldrich horror.

kool_newt@lemm.ee · edit-2 9 months ago

deleted by creator

onlinepersona@programming.dev · 1 year ago

text?

kool_newt@lemm.ee · edit-2 9 months ago

deleted by creator

TechNom (nobody)@programming.dev · 1 year ago

Protobuf is also not a proper binary alternative for Yaml. Protobuf needs a schema in the form of its IDL. Cbor and messagepack might be more analogous.

Windex007@lemmy.world · 1 year ago

Genuinely curious what features OP is looking for, specifically for serialization as per the post, that has resulted in the conclusion being yaml.

JoYo@lemmy.ml · 1 year ago

I didn’t even know there was a difference.