tnl.net
robots.txt

Robots Exclusion Standard data for tnl.net

Resource Scan

Scan Details

Site Domain tnl.net
Base Domain tnl.net
Scan Status Ok
Last Scan2024-05-14T09:16:13+00:00
Next Scan 2024-06-13T09:16:13+00:00

Last Scan

Scanned2024-05-14T09:16:13+00:00
URL https://tnl.net/robots.txt
Domain IPs 104.21.74.12, 172.67.152.174, 2606:4700:3034::6815:4a0c, 2606:4700:3036::ac43:98ae
Response IP 104.21.74.12
Found Yes
Hash 4498f469a6a30416dc92a654611929a64d7b8e6a7e39651194bbb10ca26652b9
SimHash 48250884a0b2

Groups

*

Rule Path
Disallow /wp-admin/

ccbot

Rule Path
Disallow /blog/

gptbot

Rule Path
Disallow /blog/

omgili

Rule Path
Disallow /blog/

omgilibot

Rule Path
Disallow /blog/

*

Rule Path
Disallow

adsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://tnl.net/sitemap_index.xml

Comments

  • SECURITY DISALLOW
  • ---------------------------
  • START DISALLOW LLM CRAWLERS
  • ---------------------------
  • END DISALLOW LLM CRAWLERS
  • ---------------------------
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK