wunderground.com
robots.txt

Robots Exclusion Standard data for wunderground.com

Resource Scan

Scan Details

Site Domain wunderground.com
Base Domain wunderground.com
Scan Status Ok
Last Scan2024-06-15T06:42:59+00:00
Next Scan 2024-06-22T06:42:59+00:00

Last Scan

Scanned2024-06-15T06:42:59+00:00
URL https://wunderground.com/robots.txt
Redirect https://www.wunderground.com/robots.txt
Redirect Domain www.wunderground.com
Redirect Base wunderground.com
Domain IPs 23.15.25.62, 2600:1413:b000:58d::2e03, 2600:1413:b000:599::2e03
Redirect IPs 104.69.160.216, 2600:1413:1:985::2e03
Response IP 23.41.77.84
Found Yes
Hash fda67f5a5f0c9562edc72d31585aa238fe2c69f048c921e599dd487ff8f77b3a
SimHash 2d40dd156dd1

Groups

*

Rule Path
Disallow /bundle-next/
Disallow /CHANGELOG.txt

Other Records

Field Value
sitemap https://www.wunderground.com/sitemaps/sitemap.xml

Comments

  • /robots.txt
  • Last updated by VShrivastava 02/18/2020
  • Disallowed for PhantomJS
  • Crawl-delay: 10
  • App paths
  • Directories
  • Files
  • Paths (clean URLs)
  • Disallow: /migration/
  • Paths (no clean URLs)