wunderground.com
robots.txt

Robots Exclusion Standard data for wunderground.com

Resource Scan

Scan Details

Site Domain wunderground.com
Base Domain wunderground.com
Scan Status Ok
Last Scan2024-11-09T18:51:09+00:00
Next Scan 2024-11-16T18:51:09+00:00

Last Scan

Scanned2024-11-09T18:51:09+00:00
URL https://wunderground.com/robots.txt
Redirect https://www.wunderground.com/robots.txt
Redirect Domain www.wunderground.com
Redirect Base wunderground.com
Domain IPs 173.222.146.176, 2600:1413:1:985::2e03
Redirect IPs 173.222.146.176, 2600:1413:1:985::2e03
Response IP 173.222.146.176
Found Yes
Hash fda67f5a5f0c9562edc72d31585aa238fe2c69f048c921e599dd487ff8f77b3a
SimHash 2d40dd156dd1

Groups

*

Rule Path
Disallow /bundle-next/
Disallow /CHANGELOG.txt

Other Records

Field Value
sitemap https://www.wunderground.com/sitemaps/sitemap.xml

Comments

  • /robots.txt
  • Last updated by VShrivastava 02/18/2020
  • Disallowed for PhantomJS
  • Crawl-delay: 10
  • App paths
  • Directories
  • Files
  • Paths (clean URLs)
  • Disallow: /migration/
  • Paths (no clean URLs)