maps.wunderground.com
robots.txt

Robots Exclusion Standard data for maps.wunderground.com

Resource Scan

Scan Details

Site Domain maps.wunderground.com
Base Domain wunderground.com
Scan Status Ok
Last Scan2024-05-04T14:21:03+00:00
Next Scan 2024-06-03T14:21:03+00:00

Last Scan

Scanned2024-05-04T14:21:03+00:00
URL http://maps.wunderground.com/robots.txt
Redirect https://www.wunderground.com/robots.txt
Redirect Domain www.wunderground.com
Redirect Base wunderground.com
Domain IPs 18.158.212.228
Redirect IPs 173.222.146.176, 2600:1413:1:985::2e03
Response IP 23.77.12.42
Found Yes
Hash fda67f5a5f0c9562edc72d31585aa238fe2c69f048c921e599dd487ff8f77b3a
SimHash 2d40dd156dd1

Groups

*

Rule Path
Disallow /bundle-next/
Disallow /CHANGELOG.txt

Other Records

Field Value
sitemap https://www.wunderground.com/sitemaps/sitemap.xml

Comments

  • /robots.txt
  • Last updated by VShrivastava 02/18/2020
  • Disallowed for PhantomJS
  • Crawl-delay: 10
  • App paths
  • Directories
  • Files
  • Paths (clean URLs)
  • Disallow: /migration/
  • Paths (no clean URLs)