namaantg.com
robots.txt

Robots Exclusion Standard data for namaantg.com

Resource Scan

Scan Details

Site Domain namaantg.com
Base Domain namaantg.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-08-24T18:45:32+00:00
Next Scan 2024-11-22T18:45:32+00:00

Last Successful Scan

Scanned2021-11-26T02:40:54+00:00
URL http://namaantg.com/robots.txt
Found Yes
Hash 18098f5944171581092ef664554f9f0d94c3b2951ad761797d488bca6a680188
SimHash 6854dd834697

Groups

baiduspider

Rule Path
Disallow

baiduspider-image

Rule Path
Disallow

baiduspider-render

Rule Path
Disallow

sosospider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

sogou news spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

teoma

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

googlebot-mobile

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow /

psbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap /sitemap.xml

Warnings

  • 2 invalid lines.