ita.proz.com
robots.txt

Robots Exclusion Standard data for ita.proz.com

Resource Scan

Scan Details

Site Domain ita.proz.com
Base Domain proz.com
Scan Status Ok
Last Scan2025-12-27T04:00:48+00:00
Next Scan 2026-01-26T04:00:48+00:00

Last Scan

Scanned2025-12-27T04:00:48+00:00
URL https://ita.proz.com/robots.txt
Domain IPs 104.26.12.121, 104.26.13.121, 172.67.72.152, 2606:4700:20::681a:c79, 2606:4700:20::681a:d79, 2606:4700:20::ac43:4898
Response IP 104.26.12.121
Found Yes
Hash bb86636ebb438dc79990626c39bfd1f60146309632040030b1251f32320c4c2c
SimHash cf474c94d2d2

Groups

mediapartners-google*

Rule Path
Disallow

petalbot

Rule Path
Disallow /

*

Rule Path
Disallow /crawler-pit/
Disallow /profile$
Disallow /profile/$
Disallow /profile?
Disallow /profile/?
Disallow /translator/2372$
Disallow /profile/127329$
Disallow /?sp=login
Disallow /?sp=404
Disallow /translation-news/wp-admin
Disallow

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://www.proz.com/sitemap__index_Main.xml.gz
sitemap https://www.proz.com/sitemap__index_SoftwareComparisonTool.xml.gz
sitemap https://www.proz.com/sitemap__index_Kudoz.xml.gz
sitemap https://www.proz.com/sitemap__index_Forums.xml.gz
sitemap https://www.proz.com/sitemap__index_Businesses.xml.gz
sitemap https://www.proz.com/sitemap__index_BlueBoard.xml.gz
sitemap https://www.proz.com/sitemap__index_Profiles.xml.gz
sitemap https://www.proz.com/sitemap__index_ClassicalJobPostings.xml.gz
sitemap https://www.proz.com/sitemap__index_TranslatorDirectory.xml.gz