technorati.com
robots.txt

Robots Exclusion Standard data for technorati.com

Resource Scan

Scan Details

Site Domain technorati.com
Base Domain technorati.com
Scan Status Ok
Last Scan2024-11-13T17:33:48+00:00
Next Scan 2024-11-20T17:33:48+00:00

Last Scan

Scanned2024-11-13T17:33:48+00:00
URL https://technorati.com/robots.txt
Domain IPs 129.213.208.64
Response IP 129.213.208.64
Found Yes
Hash a1428640660274e3bbd47c67d559a8c618eee77115ab33573f2ce7fc0af9f74b
SimHash 9658f9a62d87

Groups

*

Rule Path
Disallow /google/
Disallow /search/
Disallow /provisioning/
Disallow /library/
Disallow /files/
Disallow /login.php
Disallow /login_proxy.php
Disallow /tv/login.php
Disallow /ajaxapi/login.php
Disallow /hdtv/login.php
Disallow /movies_channel/login.php
Disallow /provisioning/client_login.php
Disallow /provisioning/login.php
Disallow /provisioning2/login.php
Disallow /store/login.php
Disallow /templates/maya/components/login.php
Disallow /templates/maya/components/login_delph.php
Disallow /toolbar2/files/generics/login.php
Disallow /tv_movies/login.php
Disallow /yummy/ilogin.php
Disallow /zmail/zmpage_login.php
Disallow /*?*u_d=
Disallow /outbound/
Disallow /*?*email=
Disallow /*?*e-mail=