twsn.net
robots.txt

Robots Exclusion Standard data for twsn.net

Resource Scan

Scan Details

Site Domain twsn.net
Base Domain twsn.net
Scan Status Ok
Last Scan2024-11-13T13:18:29+00:00
Next Scan 2024-11-20T13:18:29+00:00

Last Scan

Scanned2024-11-13T13:18:29+00:00
URL https://twsn.net/robots.txt
Domain IPs 192.0.66.88
Response IP 192.0.66.88
Found Yes
Hash ef72a989924640631bf2d239a80f2a209b1fd00e93d236f0c03770bf4cad6c78
SimHash 69248a60c9b2

Groups

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://twsn.net/sitemap.xml
sitemap https://twsn.net/news-sitemap.xml