innastrona.pl
robots.txt

Robots Exclusion Standard data for innastrona.pl

Resource Scan

Scan Details

Site Domain innastrona.pl
Base Domain innastrona.pl
Scan Status Ok
Last Scan2024-09-25T21:59:26+00:00
Next Scan 2024-10-02T21:59:26+00:00

Last Scan

Scanned2024-09-25T21:59:26+00:00
URL http://innastrona.pl/robots.txt
Redirect https://queer.pl/robots.txt
Redirect Domain queer.pl
Redirect Base queer.pl
Domain IPs 116.202.3.13
Redirect IPs 116.202.3.13
Response IP 116.202.3.13
Found Yes
Hash 4f932460d4cfc5bb7386a713c2010d0f8ea0109ae1769dbfa301f649defada98
SimHash 791cc8e2e0b6

Groups

*

Rule Path
Allow /data/article/
Allow /data/rssPost/
Allow /data/wiki/
Allow /data/top/
Disallow /cache/
Disallow /tmp/
Disallow /app/
Disallow /admin/
Disallow /private/
Disallow /test/
Disallow /lib/
Disallow /msg/
Disallow /epub/
Disallow /ebook/
Disallow /user/
Disallow /data/pic/
Disallow /info/terms
Disallow /info/privacy
Disallow /info/cookies
Disallow /info/partners

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://queer.pl/sitemap