appisgreat.com
robots.txt

Robots Exclusion Standard data for appisgreat.com

Resource Scan

Scan Details

Site Domain appisgreat.com
Base Domain appisgreat.com
Scan Status Ok
Last Scan2024-06-11T17:06:38+00:00
Next Scan 2024-06-18T17:06:38+00:00

Last Scan

Scanned2024-06-11T17:06:38+00:00
URL https://appisgreat.com/robots.txt
Redirect https://www.appisgreat.com/robots.txt
Redirect Domain www.appisgreat.com
Redirect Base appisgreat.com
Domain IPs 104.18.16.125, 104.18.17.125, 2606:4700::6812:107d, 2606:4700::6812:117d
Redirect IPs 104.18.16.125, 104.18.17.125, 2606:4700::6812:107d, 2606:4700::6812:117d
Response IP 104.18.16.125
Found Yes
Hash 271df197aacc0ffdefe0e8baa29df3e43dd7b2b8a5aee0133baea0a82c5f3ba7
SimHash 087cc8a07b10

Groups

*

Rule Path
Disallow /*about.html$
Disallow /*contact.html$
Disallow /*guidance.html$
Disallow /*disclaimer.html$
Disallow /*dmca.html$
Disallow /*privacyPolicy.html$
Disallow /*rules.html$
Disallow /search
Disallow /*/search

baiduspider*
sogou*

Rule Path
Disallow /

scrapy
semrushbot
ahrefsbot
blexbot
ccbot
cliqzbot
dotbot
ia_archiver
mbcrawler
mj12bot
photon
linguee

Rule Path
Disallow /