singtao.ca
robots.txt

Robots Exclusion Standard data for singtao.ca

Resource Scan

Scan Details

Site Domain singtao.ca
Base Domain singtao.ca
Scan Status Ok
Last Scan2024-11-01T17:42:05+00:00
Next Scan 2024-11-08T17:42:05+00:00

Last Scan

Scanned2024-11-01T17:42:05+00:00
URL https://singtao.ca/robots.txt
Redirect https://www.singtao.ca/robots.txt
Redirect Domain www.singtao.ca
Redirect Base singtao.ca
Domain IPs 104.26.14.193, 104.26.15.193, 172.67.69.213, 2606:4700:20::681a:ec1, 2606:4700:20::681a:fc1, 2606:4700:20::ac43:45d5
Redirect IPs 104.26.14.193, 104.26.15.193, 172.67.69.213, 2606:4700:20::681a:ec1, 2606:4700:20::681a:fc1, 2606:4700:20::ac43:45d5
Response IP 104.26.15.193
Found Yes
Hash 6bbe52675c76c1458b8ee19036ee316fefbc026ba600a9e6007b6b4bc813ee3c
SimHash 726ad8d0e413

Groups

googlebot
googlebot-news
mediapartners-google
bingbot
msnbot
slurp
yandex
baiduspider
alexabot
facebot
ia_archiver

Rule Path
Allow /

Other Records

Field Value
crawl-delay 300

sosospider

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/upgrade/
Disallow /wp-content/wflogs/
Disallow /wp-content/rsspi-log/
Disallow /wp-content/languages/
Disallow /dushi/
Disallow /dushi-single/
Disallow /deals/blackfriday/
Disallow /deals/singclub/
Disallow /readme.html
Disallow /refer/
Disallow /?s=
Disallow /search/
Disallow /daily_

Other Records

Field Value
sitemap https://www.singtao.ca/gen-sitemap/post/today/
sitemap https://www.singtao.ca/gen-sitemap/rtnews/today/