proz.com
robots.txt

Robots Exclusion Standard data for proz.com

Resource Scan

Scan Details

Site Domain proz.com
Base Domain proz.com
Scan Status Ok
Last Scan2024-11-13T00:35:00+00:00
Next Scan 2024-11-20T00:35:00+00:00

Last Scan

Scanned2024-11-13T00:35:00+00:00
URL https://proz.com/robots.txt
Redirect http://www.proz.com/robots.txt
Redirect Domain www.proz.com
Redirect Base proz.com
Domain IPs 172.66.40.225, 172.66.43.31, 2606:4700:3108::ac42:28e1, 2606:4700:3108::ac42:2b1f
Redirect IPs 172.66.40.225, 172.66.43.31, 2606:4700:3108::ac42:28e1, 2606:4700:3108::ac42:2b1f
Response IP 172.66.43.31
Found Yes
Hash bb86636ebb438dc79990626c39bfd1f60146309632040030b1251f32320c4c2c
SimHash cf474c94d2d2

Groups

mediapartners-google*

Rule Path
Disallow

petalbot

Rule Path
Disallow /

*

Rule Path
Disallow /crawler-pit/
Disallow /profile$
Disallow /profile/$
Disallow /profile?
Disallow /profile/?
Disallow /translator/2372$
Disallow /profile/127329$
Disallow /?sp=login
Disallow /?sp=404
Disallow /translation-news/wp-admin
Disallow

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://www.proz.com/sitemap__index_Main.xml.gz
sitemap https://www.proz.com/sitemap__index_SoftwareComparisonTool.xml.gz
sitemap https://www.proz.com/sitemap__index_Kudoz.xml.gz
sitemap https://www.proz.com/sitemap__index_Forums.xml.gz
sitemap https://www.proz.com/sitemap__index_Businesses.xml.gz
sitemap https://www.proz.com/sitemap__index_BlueBoard.xml.gz
sitemap https://www.proz.com/sitemap__index_Profiles.xml.gz
sitemap https://www.proz.com/sitemap__index_ClassicalJobPostings.xml.gz
sitemap https://www.proz.com/sitemap__index_TranslatorDirectory.xml.gz