linuxgenie.net
robots.txt

Robots Exclusion Standard data for linuxgenie.net

Resource Scan

Scan Details

Site Domain linuxgenie.net
Base Domain linuxgenie.net
Scan Status Ok
Last Scan2025-11-06T07:00:49+00:00
Next Scan 2025-11-13T07:00:49+00:00

Last Scan

Scanned2025-11-06T07:00:49+00:00
URL https://linuxgenie.net/robots.txt
Domain IPs 104.21.28.166, 172.67.170.242, 2606:4700:3033::ac43:aaf2, 2606:4700:3036::6815:1ca6
Response IP 104.21.28.166
Found Yes
Hash a7ec2f820bcd6521f550b38492dca5acbd5c7ab6dc0e5a7457ab4bcafbf9802b
SimHash 5964d8c0a193

Groups

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://linuxgenie.net/sitemap_index.xml

Comments

  • ======Raptive Begin======
  • ======Raptive End======
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK