Allow umlaut domains for website addresses #952

schneidr · 2023-04-18T06:57:32Z

Checklist

All new and existing tests are passing
(If adding features:) I have added tests to cover my changes
I have added an entry to CHANGES.rst because this is a user-facing change or an important bugfix
I have written proper commit message(s)

What changes does this Pull Request introduce?

Changed website validation to allow domain names containing umlauts

Why is this necessary?

Resolves issue #951

ix5 · 2023-04-19T18:48:53Z

Thank you for this PR. In my eyes, we should avoid introducing new dependencies.

The relevant part of the validators package is domain.py.
The package is MIT licensed and we can copy over that function into isso rather than having to load a whole package for a simple regex check.

jelmer

Please also add a test for the use case you're trying to address

jelmer · 2023-04-19T19:02:43Z

isso/views/comments.py

-    return __url_re.match(text) is not None
+    text = normalize(text)
+    # urlparse does not like port numbers in URLs
+    text = re.sub(r':\d+', '', text)


This comment seems odd to me - urlparse handles port numbers in URLs fine, so there must be something else going on?

My bad, the reason for removing the port was not urlparse, it is validators.domain() which does not accept domain:port. I guess I could clean this up by using hostname instead of netloc, but if the complete URL is supposed to be validated these lines are most probably not staying anyway.

I did add my test case in test_comments.py.

jelmer · 2023-04-19T19:04:19Z

Do we actually need to check the domain name at all?

ix5 · 2023-04-21T14:15:06Z

Do we actually need to check the domain name at all?

Because IIRC the website is inserted as a link (if given), we should make sure it is valid.

If we skipped the validity check, I'm not sure that the markup escaping would catch e.g. someone entering malicious Javascript foo</a>.

jelmer · 2023-04-21T18:35:33Z

Do we actually need to check the domain name at all?

Because IIRC the website is inserted as a link (if given), we should make sure it is valid.

If we skipped the validity check, I'm not sure that the markup escaping would catch e.g. someone entering malicious Javascript foo</a>.

That seems like an argument for just fixing the markup escaping to me...

schneidr · 2023-04-23T06:29:51Z

Do we actually need to check the domain name at all?

Because IIRC the website is inserted as a link (if given), we should make sure it is valid.

If we skipped the validity check, I'm not sure that the markup escaping would catch e.g. someone entering malicious Javascript foo</a>.

So, the goal is to actually check the complete entered URL, not only if the domain is valid, as I assumed?

jelmer · 2023-08-04T13:01:47Z

Sorry for the delay; let's just merge this, since it's clearly an improvement over the current situation. Ideally, we'd be moving away from checking for valid domains to checking for malicious things though.

schneidr added 2 commits April 18, 2023 08:28

allow domain names containing umlaut characters

ca1978f

added changelog entry

a9e55a4

schneidr changed the title ~~Umlaut domains~~ Allow umlaut domains for website addresses Apr 18, 2023

ix5 added server (Python) server code bug needs-decision Architectural/Behavioral decision by maintainers needed labels Apr 18, 2023

ix5 added this to the 0.14 milestone Apr 18, 2023

jelmer requested changes Apr 19, 2023

View reviewed changes

Revisions from comments from pull request isso-comments#952

9001945

schneidr requested a review from jelmer April 23, 2023 07:06

jelmer approved these changes Aug 4, 2023

View reviewed changes

jelmer merged commit 73d9886 into isso-comments:master Aug 4, 2023

schneidr deleted the umlaut-domains branch August 4, 2023 13:08

ix5 mentioned this pull request Jan 20, 2024

Unable to post comments with umlaut domains in the website field #951

Closed

3 tasks

ix5 modified the milestones: 0.14, 0.13.1 May 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow umlaut domains for website addresses #952

Allow umlaut domains for website addresses #952

schneidr commented Apr 18, 2023 •

edited

Loading

ix5 commented Apr 19, 2023

jelmer left a comment

jelmer Apr 19, 2023

schneidr Apr 23, 2023

schneidr Apr 23, 2023

jelmer commented Apr 19, 2023

ix5 commented Apr 21, 2023

jelmer commented Apr 21, 2023

schneidr commented Apr 23, 2023

jelmer commented Aug 4, 2023

Allow umlaut domains for website addresses #952

Allow umlaut domains for website addresses #952

Conversation

schneidr commented Apr 18, 2023 • edited Loading

Checklist

What changes does this Pull Request introduce?

Why is this necessary?

ix5 commented Apr 19, 2023

jelmer left a comment

Choose a reason for hiding this comment

jelmer Apr 19, 2023

Choose a reason for hiding this comment

schneidr Apr 23, 2023

Choose a reason for hiding this comment

schneidr Apr 23, 2023

Choose a reason for hiding this comment

jelmer commented Apr 19, 2023

ix5 commented Apr 21, 2023

jelmer commented Apr 21, 2023

schneidr commented Apr 23, 2023

jelmer commented Aug 4, 2023

schneidr commented Apr 18, 2023 •

edited

Loading