spaopportunities.com
robots.txt

Robots Exclusion Standard data for spaopportunities.com

Resource Scan

Scan Details

Site Domain spaopportunities.com
Base Domain spaopportunities.com
Scan Status Ok
Last Scan2024-10-22T03:31:29+00:00
Next Scan 2024-11-21T03:31:29+00:00

Last Scan

Scanned2024-10-22T03:31:29+00:00
URL https://spaopportunities.com/robots.txt
Redirect https://www.spaopportunities.com/robots.txt
Redirect Domain www.spaopportunities.com
Redirect Base spaopportunities.com
Domain IPs 104.21.79.124, 172.67.145.162, 2606:4700:3033::ac43:91a2, 2606:4700:3035::6815:4f7c
Redirect IPs 104.21.79.124, 172.67.145.162, 2606:4700:3033::ac43:91a2, 2606:4700:3035::6815:4f7c
Response IP 104.21.79.124
Found Yes
Hash 2130ae90bf30acd5e49c1a667c051f66171b20c540f380bf99520451133463cf
SimHash 521cc472e0b3

Groups

ahrefsbot

Rule Path
Disallow /

blp_bbot/0.1

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

cyberalert

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

webcrawler

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

cityreview

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

yandex

Rule Path
Disallow /

discobot

Rule Path
Disallow /

birubot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /
Disallow /

twitterbot

Rule Path
Disallow /

gosospider

Rule Path
Disallow /

steeler

Rule Path
Disallow /

summify

Rule Path
Disallow /

accelobot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /images/dir/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Warnings

  • 2 invalid lines.
  • `user agent` is not a known field.