arclic.top
robots.txt

Robots Exclusion Standard data for arclic.top

Resource Scan

Scan Details

Site Domain arclic.top
Base Domain arclic.top
Scan Status Ok
Last Scan2025-11-27T06:11:57+00:00
Next Scan 2025-12-04T06:11:57+00:00

Last Scan

Scanned2025-11-27T06:11:57+00:00
URL https://arclic.top/robots.txt
Domain IPs 104.21.39.141, 172.67.146.27, 2606:4700:3033::ac43:921b, 2606:4700:3036::6815:278d
Response IP 104.21.39.141
Found Yes
Hash e5d71a5759711d77c72050b3d9106da9ccd32e9d16b1b0e46f8783ba006504bc
SimHash 493d4ef4f603

Groups

*

Rule Path
Disallow /admin/
Disallow /login/
Disallow /api/
Disallow /account/
Disallow /wp-admin/
Disallow /cgi-bin/
Disallow /tmp/
Disallow /private/

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /
Disallow /wp-content/uploads/private/

mediapartners-google

Rule Path
Allow /

google-adsbot

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
sitemap https://arclic.top/sitemap.xml