-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Description
I'm working on updating jsdom's TR46 implementation for the Unicode 16.0.0 / TR46 rev 33 changes, and I believe I've found some incorrect test cases.
toascii.json (all from WebKit upstreaming in 6fa3fe8)
{
"input": "www.lookout.net\u2A7480",
"output": null
},My implementation converts this to www.lookout.net::=80.
{
"input": "www\u00A0.lookout.net",
"output": null
}My implementation passes this through as valid. I cannot find anything in https://unicode.org/reports/tr46/#Validity_Criteria which would disallow it.
{
"input": "\u001flookout.net",
"output": null
}My implementation passes this through as valid. This would be invalid if we were passing UseSTD3ASCIIRules=true, but I don't believe we are. At least, my test harness never has, and the URL Standard passes it as false (via setting beStrict to false).
IdnaTestV2.json (from d0cd7c0)
{
"comment": "V4; V2 (ignored)",
"input": "xn--xn--a--gua.pt",
"output": null
},My implementation gives back the input for this. I haven't yet dug into where the bug might be here. It's possible I didn't adapt my code well enough.
{
"comment": "V4; V2 (ignored)",
"input": "xn--xn---epa",
"output": null
},My implementation gives back the input for this. I haven't yet dug into where the bug might be here. It's possible I didn't adapt my code well enough.