thuthuataccess.com
robots.txt

Robots Exclusion Standard data for thuthuataccess.com

Resource Scan

Scan Details

Site Domain thuthuataccess.com
Base Domain thuthuataccess.com
Scan Status Ok
Last Scan2024-09-30T18:13:42+00:00
Next Scan 2024-10-07T18:13:42+00:00

Last Scan

Scanned2024-09-30T18:13:42+00:00
URL https://thuthuataccess.com/robots.txt
Domain IPs 123.30.182.67
Response IP 123.30.182.67
Found Yes
Hash 04a578287d919e05b5aff829d792f6b0a08819446c26c39eae04ca32f2d52bdf
SimHash 381598248373

Groups

*

Rule Path
Disallow /forum/admin/
Disallow /forum/archive/
Disallow /forum/nimda/
Disallow /SQLDumper/
Disallow /forum/captcha.php
Disallow /forum/editpost.php
Disallow /forum/misc.php
Disallow /forum/modcp.php
Disallow /forum/moderation.php
Disallow /forum/newreply.php
Disallow /forum/newthread.php
Disallow /forum/private.php
Disallow /forum/ratethread.php
Disallow /forum/report.php
Disallow /forum/sendthread.php
Disallow /forum/task.php
Disallow /forum/usercp.php
Disallow /forum/usercp2.php
Disallow /forum/calendar.php?action=addevent
Disallow /forum/printthread.php
Allow /
Allow /forum/

coccoc

Rule Path
Disallow /forum/

baiduspider

Rule Path
Disallow /forum/

yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://thuthuataccess.com/forum/sitemap-index.xml
sitemap http://thuthuataccess.com/forum/tagsitemap.xml