sigma-4pc.com
robots.txt

Robots Exclusion Standard data for sigma-4pc.com

Resource Scan

Scan Details

Site Domain sigma-4pc.com
Base Domain sigma-4pc.com
Scan Status Ok
Last Scan2024-09-19T14:00:39+00:00
Next Scan 2024-09-26T14:00:39+00:00

Last Scan

Scanned2024-09-19T14:00:39+00:00
URL https://sigma-4pc.com/robots.txt
Domain IPs 104.21.28.12, 172.67.170.37, 2606:4700:3033::6815:1c0c, 2606:4700:3037::ac43:aa25
Response IP 104.21.28.12
Found Yes
Hash e3206b86738f573f599798d347f74da96450a4e99cfa68a099bb8663632fe100
SimHash 69445d800e32

Groups

*

Rule Path
Allow /
Disallow /readme.html
Disallow /wp-admin/
Disallow /modules/
Disallow /modules/*
Disallow /*/modules/
Disallow /*/modules/*
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Disallow /wp-login.php
Disallow /trackback/
Disallow */trackback/
Disallow /*?replytocom
Disallow /*.php$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /*.php*
Disallow /trackback*
Disallow /*.inc$
Disallow /*.txt$

mediapartners-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sigma-4pc.com/sitemap_index.xml