spacedaily.com
robots.txt

Robots Exclusion Standard data for spacedaily.com

Resource Scan

Scan Details

Site Domain spacedaily.com
Base Domain spacedaily.com
Scan Status Ok
Last Scan2024-05-25T22:53:35+00:00
Next Scan 2024-06-01T22:53:35+00:00

Last Scan

Scanned2024-05-25T22:53:35+00:00
URL https://spacedaily.com/robots.txt
Redirect https://www.spacedaily.com/robots.txt
Redirect Domain www.spacedaily.com
Redirect Base spacedaily.com
Domain IPs 104.21.58.2, 172.67.196.75, 2606:4700:3031::6815:3a02, 2606:4700:3035::ac43:c44b
Redirect IPs 104.21.58.2, 172.67.196.75, 2606:4700:3031::6815:3a02, 2606:4700:3035::ac43:c44b
Response IP 172.67.196.75
Found Yes
Hash e0ab0deb00cc7f215b79717aef16ddf73babc1f3d9570226bcbce6a87e59992a
SimHash 2d51d2b255b5

Groups

duckduckbot

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

applenewsbot

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

adsbot-google

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

googlebot

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

twitterbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 100

mediapartners-google

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

googlebot-image

Rule Path
Disallow /

googlebot-mobile

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

googlebot-news

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

googlebot-video

Rule Path
Disallow /

bingbot

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

bing

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

msnbot

Rule Path
Disallow /

facebot

Rule Path
Disallow /

yahoo-slurp

Rule Path
Disallow /

slurp

Rule Path
Disallow /

ia_archiver

Rule Path
Allow *

Other Records

Field Value
crawl-delay 100

yandex

Rule Path
Disallow /

Comments

  • User-agent: *
  • Disallow: /