Improving hostname validation #2455

@AparnaKarve this is based on discussion in this PR, only extend the validation to front-end. Can you mention some case that this validation is not applicable?

AparnaKarve · 2017-11-01T18:44:05Z

@douglasgabriel I don't know off-hand.

What I do know is that my Lenovo XClarity EMS was created using the https prefix.

Although I just verified that it does work even without https specified.

@h-kataria Would you have an idea if we allow using the protocol (http/https) in the hostname in other areas of the app?

douglasgabriel · 2017-11-02T12:44:43Z

In this validation occurs what I believe that is an inconsistency: the user provides some data (with the protocol, for example) and the system validates only a part of this input (without protocol), after this the system store all data provided by user (with protocol). So, if the user be allowed to provide the protocol, so I believe that the validation strategy must be revised, don't you think?

saulotoledo · 2017-12-08T23:49:13Z

I believe that the validation in this PR should be the same considered here in the ruby code. Also I think that the behavior considered for the hostname should be the same considered for the IP.

Imagine that the hostname mysample.hostname.com points to the (fake) IP 100.100.100.100. Making an HTTP(S) call to mysample.hostname.com or 100.100.100.100 should return the same results unless there is an explicit requirement not to do so.

See examples of the current PR:

mysample.hostname.com <- VALID
mysample.hostname.com/path <- VALID
100.100.100.100 <- VALID
100.100.100.100/path <- INVALID

According to the previous explanation, 1 and 4 are equivalent. Thus, 4 should not be invalid (or both 2 and 4 should be invalid).

The first question then: what option should we choose?
For any answer, both PRs should be updated.

In order to contribute with the discussion, I think we should accept the path in the URLs. Let me prove my point with an example: imagine that, for some reason (security, for example) I just have one public IP, but I have several hosts I want to put in my MiQ instance. I can install something in that public IP (Zuul, for example, or any other technology I want) that transparently redirect subdomains to my internal hosts. Something like:

my.public.host.com/myhostname1 -> redirect to my.local.hostname1.com
my.public.host.com/myhostname2 -> redirect to my.local.hostname2.com
my.public.host.com/myhostname3 -> redirect to my.local.hostname3.com

Considering that my.local.* is not visible for the MiQ, this should be a possible situation.

@agrare, @rodneyhbrown7 could you confirm the information and decide about what should be done in these PRs?

saulotoledo · 2017-12-09T00:13:34Z

app/assets/javascripts/directives/hostname_validation.js

+        return /\b:\/\/\b/.test(hostname);
+      };
+      var isIpAddressWithPath = function(hostname) {
+        var isIpAddress = /\b\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3}\b/.test(hostname);


I think the following cases should be invalid: 100.100.100.100invalid, invalid100.100.100.100invalid. Could you confirm that?

In addition, this does not support IPV6 format, pointed out in the UI. The following expression can handle ipv4 and ipv6:

return /((^\s*((([0-9]|[1-9][0-9]|1[0-9]{2}|2[0-4][0-9]|25[0-5])\.){3}([0-9]|[1-9][0-9]|1[0-9]{2}|2[0-4][0-9]|25[0-5]))\s*$)|(^\s*((([0-9A-Fa-f]{1,4}:){7}([0-9A-Fa-f]{1,4}|:))|(([0-9A-Fa-f]{1,4}:){6}(:[0-9A-Fa-f]{1,4}|((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3})|:))|(([0-9A-Fa-f]{1,4}:){5}(((:[0-9A-Fa-f]{1,4}){1,2})|:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3})|:))|(([0-9A-Fa-f]{1,4}:){4}(((:[0-9A-Fa-f]{1,4}){1,3})|((:[0-9A-Fa-f]{1,4})?:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|(([0-9A-Fa-f]{1,4}:){3}(((:[0-9A-Fa-f]{1,4}){1,4})|((:[0-9A-Fa-f]{1,4}){0,2}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|(([0-9A-Fa-f]{1,4}:){2}(((:[0-9A-Fa-f]{1,4}){1,5})|((:[0-9A-Fa-f]{1,4}){0,3}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|(([0-9A-Fa-f]{1,4}:){1}(((:[0-9A-Fa-f]{1,4}){1,6})|((:[0-9A-Fa-f]{1,4}){0,4}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|(:(((:[0-9A-Fa-f]{1,4}){1,7})|((:[0-9A-Fa-f]{1,4}){0,5}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:)))(%.+)?\s*$))/.test(hostname);

agrare · 2017-12-09T01:52:27Z

I think we should accept the path in the URLs.

This won't work across the board because most on-premise providers add a path after the hostname/ipaddr for the API endpoint so you wouldn't be able to tell what component of the path is meant to point to another host and what part is for the API endpoint.

saulotoledo · 2017-12-09T13:38:23Z

This won't work across the board because most on-premise providers add a path after the hostname/ipaddr for the API endpoint so you wouldn't be able to tell what component of the path is meant to point to another host and what part is for the API endpoint.

Ok. In addition, strictly speaking, host and path are different concepts, and the port has a specific field for it. So, after thinking a little about it, I agree with you. So, can we continue these PRs by removing the path from the host field?

@agrare @rodneyhbrown7 @douglasgabriel

saulotoledo · 2017-12-09T14:36:12Z

app/assets/javascripts/directives/hostname_validation.js

+      };
+    }
+  }
+})();


If we consider removing the path (see current discussion), here is my suggestion for this code (needs testing):

(function() { angular.module('ManageIQ').directive('hostnameValidation', directive); function directive() { return { require: 'ngModel', restrict: 'A', link: link, }; function link(_scope, _elem, _attrs, ctrl) { ctrl.$validators.hostnameValidation = function(_modelValue, viewValue) { return isHostname(viewValue) || isIpAddress(viewValue); }; /** * Verifies if the informed value is a valid hostname. * * @param {string} value - Value to be validated * @returns {boolean} True if the value is a valid hostname */ var isHostname = function(value) { return /^(([a-zA-Z0-9]|[a-zA-Z0-9][a-zA-Z0-9\-]*[a-zA-Z0-9])\.)*([A-Za-z]|[A-Za-z][A-Za-z0-9\-]*[A-Za-z0-9])$/.test(value); }; /** * Verifies if the informed value is a valid IP (IPV4 or IPV6). * * @param {string} value - Value to be validated * @returns {boolean} True if the value is a valid IP address (IPV4 or IPV6) */ var isIpAddress = function(value) { return /((^\s*((([0-9]|[1-9][0-9]|1[0-9]{2}|2[0-4][0-9]|25[0-5])\.){3}([0-9]|[1-9][0-9]|1[0-9]{2}|2[0-4][0-9]|25[0-5]))\s*$)|(^\s*((([0-9A-Fa-f]{1,4}:){7}([0-9A-Fa-f]{1,4}|:))|(([0-9A-Fa-f]{1,4}:){6}(:[0-9A-Fa-f]{1,4}|((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3})|:))|(([0-9A-Fa-f]{1,4}:){5}(((:[0-9A-Fa-f]{1,4}){1,2})|:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3})|:))|(([0-9A-Fa-f]{1,4}:){4}(((:[0-9A-Fa-f]{1,4}){1,3})|((:[0-9A-Fa-f]{1,4})?:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|(([0-9A-Fa-f]{1,4}:){3}(((:[0-9A-Fa-f]{1,4}){1,4})|((:[0-9A-Fa-f]{1,4}){0,2}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|(([0-9A-Fa-f]{1,4}:){2}(((:[0-9A-Fa-f]{1,4}){1,5})|((:[0-9A-Fa-f]{1,4}){0,3}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|(([0-9A-Fa-f]{1,4}:){1}(((:[0-9A-Fa-f]{1,4}){1,6})|((:[0-9A-Fa-f]{1,4}){0,4}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|(:(((:[0-9A-Fa-f]{1,4}){1,7})|((:[0-9A-Fa-f]{1,4}){0,5}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(\.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:)))(%.+)?\s*$))/.test(value); }; } } })();

agrare · 2017-12-13T15:08:55Z

My lack of regex skill precludes me from reviewing the code changes :)
From a functional standpoint I think we should enforce that only and ip address or dns name was added to the hostname field. No protocol, no paths, no ports.

@douglasgabriel IDK the ui tests very well but I highly recommend adding some tests for this validation.

douglasgabriel · 2017-12-14T18:46:31Z

Well, it's done guys. I've add the Saulo's solution on the validation and add some tests to it. Now it avoid the user to put protocol, path and port on the hostname field. What do you think @agrare @saulotoledo @AparnaKarve ?

agrare · 2017-12-14T19:52:39Z

Nice tests @douglasgabriel !

saulotoledo · 2017-12-14T21:25:21Z

LGTM @douglasgabriel

AparnaKarve · 2017-12-14T22:09:09Z

From a functional standpoint I think we should enforce that only and ip address or dns name was added to the hostname field. No protocol, no paths, no ports.

@douglasgabriel Based on the benchmark that @agrare mentioned above, your hostname validation directive works.

I did find something in my testing, though.
I tried something that matches this format 100.100.100.100 but interspersed with letters 100xx.100yy.100zz.100aa, and the directive did not flag it as invalid.
From a practical, real-life aspect, users will not enter invalid hostnames like the above example, especially if they want the validation to work -- so I think we can consider this test to be a nit-pick.
But I guess my point is that the regex does not seem to be 100% foolproof.

The regex does satisfy the "No protocol, no paths, no ports" condition, hence I would say -
LGTM.

miq-bot · 2017-12-15T13:23:29Z

Checked commit douglasgabriel@af4cd88 with ruby 2.3.3, rubocop 0.47.1, haml-lint 0.20.0, and yamllint 1.10.0
0 files checked, 0 offenses detected
Everything looks fine. ⭐

Improving hostname validation (cherry picked from commit a26d124)

simaishi · 2017-12-19T14:35:45Z

Gaprindashvili backport details:

$ git log -1
commit 6e39914c763887da8876418751a363ee4fe440cb
Author: Dan Clarizio <[email protected]>
Date:   Mon Dec 18 12:36:57 2017 -0800

    Merge pull request #2455 from douglasgabriel/fix-XCS-404
    
    Improving hostname validation
    (cherry picked from commit a26d12424fe0ab9d5b91155ad6083a8b278b296c)

miq-bot assigned dclarizio Oct 18, 2017

dclarizio requested a review from AparnaKarve October 18, 2017 22:32

dclarizio added compute/physical infrastructure enhancement fine/no labels Oct 18, 2017

douglasgabriel force-pushed the fix-XCS-404 branch 3 times, most recently from b57635b to b92a53f Compare October 19, 2017 14:09

saulotoledo mentioned this pull request Dec 8, 2017

Improving physical infrastructure provider hostname validation ManageIQ/manageiq#16220

Closed

saulotoledo suggested changes Dec 9, 2017

View reviewed changes

saulotoledo reviewed Dec 9, 2017

View reviewed changes

douglasgabriel force-pushed the fix-XCS-404 branch from b92a53f to 6205167 Compare December 14, 2017 18:02

improving hostname validation

af4cd88

douglasgabriel force-pushed the fix-XCS-404 branch from 6205167 to af4cd88 Compare December 15, 2017 13:22

dclarizio added the gaprindashvili/yes label Dec 18, 2017

dclarizio merged commit a26d124 into ManageIQ:master Dec 18, 2017

dclarizio added this to the Sprint 76 Ending Jan 1, 2018 milestone Dec 18, 2017

simaishi pushed a commit that referenced this pull request Dec 19, 2017

Merge pull request #2455 from douglasgabriel/fix-XCS-404

6e39914

Improving hostname validation (cherry picked from commit a26d124)

simaishi added gaprindashvili/backported and removed gaprindashvili/yes labels Dec 19, 2017

douglasgabriel deleted the fix-XCS-404 branch December 19, 2017 16:40

douglasgabriel mentioned this pull request Dec 21, 2017

Adding hostname format validation ManageIQ/manageiq#16714

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving hostname validation #2455

Improving hostname validation #2455

douglasgabriel commented Oct 18, 2017 •

edited

Loading

douglasgabriel commented Oct 18, 2017

agrare commented Oct 18, 2017

dclarizio commented Oct 18, 2017

douglasgabriel commented Oct 20, 2017

agrare commented Oct 24, 2017

rodneyhbrown7 commented Nov 1, 2017

AparnaKarve commented Nov 1, 2017 •

edited

Loading

douglasgabriel commented Nov 1, 2017

AparnaKarve commented Nov 1, 2017

douglasgabriel commented Nov 2, 2017

saulotoledo commented Dec 8, 2017 •

edited

Loading

saulotoledo Dec 9, 2017

saulotoledo Dec 9, 2017 •

edited

Loading

agrare commented Dec 9, 2017

saulotoledo commented Dec 9, 2017 •

edited

Loading

saulotoledo Dec 9, 2017

agrare commented Dec 13, 2017

douglasgabriel commented Dec 14, 2017

agrare commented Dec 14, 2017

saulotoledo commented Dec 14, 2017

AparnaKarve commented Dec 14, 2017 •

edited

Loading

miq-bot commented Dec 15, 2017

simaishi commented Dec 19, 2017

Improving hostname validation #2455

Improving hostname validation #2455

Conversation

douglasgabriel commented Oct 18, 2017 • edited Loading

douglasgabriel commented Oct 18, 2017

agrare commented Oct 18, 2017

dclarizio commented Oct 18, 2017

douglasgabriel commented Oct 20, 2017

agrare commented Oct 24, 2017

rodneyhbrown7 commented Nov 1, 2017

AparnaKarve commented Nov 1, 2017 • edited Loading

douglasgabriel commented Nov 1, 2017

AparnaKarve commented Nov 1, 2017

douglasgabriel commented Nov 2, 2017

saulotoledo commented Dec 8, 2017 • edited Loading

saulotoledo Dec 9, 2017

Choose a reason for hiding this comment

saulotoledo Dec 9, 2017 • edited Loading

Choose a reason for hiding this comment

agrare commented Dec 9, 2017

saulotoledo commented Dec 9, 2017 • edited Loading

saulotoledo Dec 9, 2017

Choose a reason for hiding this comment

agrare commented Dec 13, 2017

douglasgabriel commented Dec 14, 2017

agrare commented Dec 14, 2017

saulotoledo commented Dec 14, 2017

AparnaKarve commented Dec 14, 2017 • edited Loading

miq-bot commented Dec 15, 2017

simaishi commented Dec 19, 2017

douglasgabriel commented Oct 18, 2017 •

edited

Loading

AparnaKarve commented Nov 1, 2017 •

edited

Loading

saulotoledo commented Dec 8, 2017 •

edited

Loading

saulotoledo Dec 9, 2017 •

edited

Loading

saulotoledo commented Dec 9, 2017 •

edited

Loading

AparnaKarve commented Dec 14, 2017 •

edited

Loading