iflscience.com
robots.txt

Robots Exclusion Standard data for iflscience.com

Resource Scan

Scan Details

Site Domain iflscience.com
Base Domain iflscience.com
Scan Status Ok
Last Scan2024-11-09T00:31:09+00:00
Next Scan 2024-11-16T00:31:09+00:00

Last Scan

Scanned2024-11-09T00:31:09+00:00
URL https://iflscience.com/robots.txt
Redirect https://www.iflscience.com/robots.txt
Redirect Domain www.iflscience.com
Redirect Base iflscience.com
Domain IPs 52.222.144.110, 52.222.144.23, 52.222.144.43, 52.222.144.89
Redirect IPs 216.137.52.113, 216.137.52.126, 216.137.52.2, 216.137.52.59
Response IP 108.156.22.95
Found Yes
Hash 7854cb5030a17f621b1d1d9b1fce6b45fe61d7ad78bbc15c4caceb8a9a0434a6
SimHash 2308184582f6

Groups

*

Rule Path
Disallow /articles/vendor/
Disallow /search$

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.iflscience.com/sitemaps/sitemap.xml
sitemap https://www.iflscience.com/sitemaps/sitemap-news-1.xml

Comments

  • robots.txt file for https://www.iflscience.com/
  • bot crawl-delays
  • Disallow
  • sitemap list for https://www.iflscience.com/