gselectronic.ir
robots.txt

Robots Exclusion Standard data for gselectronic.ir

Resource Scan

Scan Details

Site Domain gselectronic.ir
Base Domain gselectronic.ir
Scan Status Ok
Last Scan2024-06-24T09:38:40+00:00
Next Scan 2024-07-08T09:38:40+00:00

Last Scan

Scanned2024-06-24T09:38:40+00:00
URL https://gselectronic.ir/robots.txt
Domain IPs 88.135.68.85
Response IP 88.135.68.85
Found Yes
Hash 18554ebce02c3daab28db53c0fc6f146564b235db6d877a818a34c675478c0c6
SimHash 5a1cc8c0a153

Groups

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://theintercept.com/sitemap_index.xml

Comments

  • Start Block AI Crawlers
  • End Block AI Crawlers
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK