trigger-the-press.com
robots.txt

Robots Exclusion Standard data for trigger-the-press.com

Resource Scan

Scan Details

Site Domain trigger-the-press.com
Base Domain trigger-the-press.com
Scan Status Ok
Last Scan2024-09-17T18:56:34+00:00
Next Scan 2024-09-24T18:56:34+00:00

Last Scan

Scanned2024-09-17T18:56:34+00:00
URL https://trigger-the-press.com/robots.txt
Domain IPs 151.101.131.7, 151.101.195.7, 151.101.3.7, 151.101.67.7, 2a04:4e42:200::775, 2a04:4e42:400::775, 2a04:4e42:600::775, 2a04:4e42::775
Response IP 199.232.47.7
Found Yes
Hash eb7464993719c5a6829575049c507f6f9627f7e1a6d8e4f638f237e90ceccfe6
SimHash 49144151cd92

Groups

*

Rule Path
Disallow /ghost/
Disallow /email/
Disallow /members/api/comments/counts/
Disallow /r/
Disallow /webmentions/receive/

twitterbot

Rule Path
Disallow /amp/*

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

https://hada.news

Rule Path
Disallow /

https://www.imediaethics.org

Rule Path
Disallow /

mojeek

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

turnitin

Rule Path
Disallow /

youbot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yandex

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://trigger-the-press.com/sitemap.xml
sitemap https://trigger-the-press.com/sitemap-pages.xml
sitemap https://trigger-the-press.com/sitemap-posts.xml
sitemap https://trigger-the-press.com/sitemap-tags.xml