wearethemis.com
robots.txt

Robots Exclusion Standard data for wearethemis.com

Resource Scan

Scan Details

Site Domain wearethemis.com
Base Domain wearethemis.com
Scan Status Ok
Last Scan2024-06-10T11:02:23+00:00
Next Scan 2024-07-10T11:02:23+00:00

Last Scan

Scanned2024-06-10T11:02:23+00:00
URL https://wearethemis.com/robots.txt
Domain IPs 104.21.45.48, 172.67.209.180, 2606:4700:3030::6815:2d30, 2606:4700:3031::ac43:d1b4
Response IP 104.21.45.48
Found Yes
Hash 6209a4a269f62819c6642dfea77390960036b1a7101d01ae28c949796a828ad0
SimHash 3a13e1472d31

Groups

*

Rule Path
Disallow /plugins/

Other Records

Field Value
sitemap https://www.wearethemis.com/sitemap

Comments

  • To add a comment to the file, start the line with the # character.
  • User-Agent is used to target a particular web crawler.
  • Any rules declared below it will apply to that User-Agent.
  • To hide a file or folder from the User-Agent, type the word 'Disallow' followed by a semi-colon.