papers.co
robots.txt

Robots Exclusion Standard data for papers.co

Resource Scan

Scan Details

Site Domain papers.co
Base Domain papers.co
Scan Status Ok
Last Scan2024-10-07T06:48:35+00:00
Next Scan 2024-10-14T06:48:35+00:00

Last Scan

Scanned2024-10-07T06:48:35+00:00
URL https://papers.co/robots.txt
Domain IPs 104.26.12.187, 104.26.13.187, 172.67.68.102, 2606:4700:20::681a:cbb, 2606:4700:20::681a:dbb, 2606:4700:20::ac43:4466
Response IP 172.67.68.102
Found Yes
Hash 6571cdbe92feca80439370ee5a2d2870a5fe1505138de7016ad5098c0db630fc
SimHash 651d52f2c081

Groups

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 20

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
crawl-delay 20

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

bingbot

Rule Path
Allow /

ia_archiver

Rule Path
Allow /

twitterbot

Rule Path
Allow /

yandex

Rule Path
Disallow /

seznambot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

applebot

Rule Path
Allow /

searchmetricsbot

Rule Path
Allow /

semrushbot

Rule Path
Allow /

semrushbot

Rule Path
Allow /Googlebot-Image

*

Rule Path
Disallow /

Other Records

Field Value
sitemap http://papers.co/sitemap.xml
sitemap http://papers.co/sitemap-image.xml