norecipes.com
robots.txt

Robots Exclusion Standard data for norecipes.com

Resource Scan

Scan Details

Site Domain norecipes.com
Base Domain norecipes.com
Scan Status Ok
Last Scan2024-09-24T22:24:57+00:00
Next Scan 2024-10-01T22:24:57+00:00

Last Scan

Scanned2024-09-24T22:24:57+00:00
URL https://norecipes.com/robots.txt
Domain IPs 104.18.37.69, 172.64.150.187, 2606:4700:4400::6812:2545, 2606:4700:4400::ac40:96bb
Response IP 104.18.37.69
Found Yes
Hash 380953140696dbb41bb7bf7c97ae8e227d43a76e9f3d89c16f38f38411c35901
SimHash f20859818291

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /page/
Disallow /check-your-email/
Disallow /thanks-email-submitted/
Disallow /thanks-email-confirmed/

amazonbot
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
facebookbot
friendlycrawler
gptbot
icc-crawler
imagesiftbot
meta-externalagent
meta-externalfetcher
oai-searchbot
petalbot
scrapy
timpibot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
facebookexternalhit
img2dataset
omgili
omgilibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://norecipes.com/sitemap_index.xml

Comments

  • Block AI https://github.com/ai-robots-txt/ai.robots.txt/blob/main/robots.txt