emmeti.com
robots.txt

Robots Exclusion Standard data for emmeti.com

Resource Scan

Scan Details

Site Domain emmeti.com
Base Domain emmeti.com
Scan Status Ok
Last Scan2025-02-11T13:12:38+00:00
Next Scan 2025-03-13T13:12:38+00:00

Last Scan

Scanned2025-02-11T13:12:38+00:00
URL https://emmeti.com/robots.txt
Domain IPs 104.18.10.93, 104.18.11.93, 2606:4700::6812:a5d, 2606:4700::6812:b5d
Response IP 104.18.10.93
Found Yes
Hash b38e971f89553b86dfbf60fa64916ddaa763f765a4743ddc67e59ea935b756c7
SimHash e9550855c541

Groups

*

Rule Path
Disallow /*?
Disallow /*.json$

ia_archiver

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

googlebot

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap https://emmeti.com/sitemap.xml

Comments

  • Removing Documents From the Wayback Machine archive.org
  • Disable *.domaintools.com crawler
  • Crawler GoogleBot