pitmodule.de
robots.txt

Robots Exclusion Standard data for pitmodule.de

Resource Scan

Scan Details

Site Domain pitmodule.de
Base Domain pitmodule.de
Scan Status Ok
Last Scan2025-05-27T05:24:16+00:00
Next Scan 2025-06-26T05:24:16+00:00

Last Scan

Scanned2025-05-27T05:24:16+00:00
URL https://www.pitmodule.de/robots.txt
Domain IPs 80.243.45.141
Response IP 80.243.45.141
Found Yes
Hash a7eec347d52de6079380f7c782606f033edbced9c43a1fe3e7918c855e15d8ab
SimHash 28501a04a352

Groups

*

Rule Path
Disallow /*/adm/
Disallow /*/admin/
Disallow /*/*admin/
Disallow /*/temp/
Disallow /*/tmp/
Disallow /*/*test/
Disallow /*/*_old/
Disallow /*/*_alt/
Disallow /*/_*/
Disallow /*.pdf$
Disallow /*.xml$
Disallow /*.jpg$
Disallow /*.jpeg$
Disallow /*.png$
Disallow /*.gif$
Disallow /*.psd$
Disallow /*.csv$
Disallow /*.xls$
Disallow /*.css$
Disallow /*.js$

webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.pitmodule.de/sitemap_location.php