smartasians.net
robots.txt

Robots Exclusion Standard data for smartasians.net

Resource Scan

Scan Details

Site Domain smartasians.net
Base Domain smartasians.net
Scan Status Ok
Last Scan2025-11-07T11:54:07+00:00
Next Scan 2025-12-07T11:54:07+00:00

Last Scan

Scanned2025-11-07T11:54:07+00:00
URL https://smartasians.net/robots.txt
Domain IPs 104.21.43.175, 172.67.182.163, 2606:4700:3031::ac43:b6a3, 2606:4700:3032::6815:2baf
Response IP 104.21.43.175
Found Yes
Hash 6ce968184e8b05e7a0bf2884f596edc8ec5988dec1b1bf29269fcc3a4808b4fb
SimHash 7a08586284b6

Groups

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

gemini

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

*

Rule Path
Allow /wp-content/plugins/
Allow /wp-content/themes/
Allow /wp-includes/js/
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php
Allow /wp-content*
Allow /wp-includes*
Allow /*.js
Allow /*.css
Allow /*.jpg
Allow /*.png
Allow /*.svg
Allow /*.gif
Allow /*.jpeg
Allow /*.webp
Disallow /cdn-cgi/l/email-protection
Disallow /wp-login*
Disallow /wp-admin/
Disallow /cgi-bin
Disallow /*?*
Disallow /page*
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow */embed$
Disallow */xmlrpc.php
Disallow *?block
Disallow /out/
Disallow /go/
Disallow /feed/

Other Records

Field Value
sitemap https://smartasians.net/sitemap_index.xml