emojiency.com
robots.txt

Robots Exclusion Standard data for emojiency.com

Resource Scan

Scan Details

Site Domain emojiency.com
Base Domain emojiency.com
Scan Status Ok
Last Scan2025-10-23T23:59:13+00:00
Next Scan 2025-10-24T23:59:13+00:00

Last Scan

Scanned2025-10-23T23:59:13+00:00
URL https://emojiency.com/robots.txt
Redirect https://www.emojiency.com/robots.txt
Redirect Domain www.emojiency.com
Redirect Base emojiency.com
Domain IPs 2a00:1098:80::8:1, 93.93.135.80
Redirect IPs 2a00:1098:80::8:1, 93.93.135.80
Response IP 93.93.135.80
Found Yes
Hash c056b5999186e737377282076e3fe2ce15a743107fdd916ba10d9cc63019df16
SimHash 5b2c4910cdf7

Groups

*

Rule Path
Allow /
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /cgi-bin/

google-extended

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

claudebot

Rule Path
Allow /

ccbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.emojiency.com/sitemap.xml

Comments

  • robots.txt — short + AI opt-in
  • Allow all crawlers, block only sensitive/system paths.
  • Explicitly opt in to AI training crawlers that require it.
  • Explicit AI/LLM training opt-ins
  • Sitemap