Bias Analysis
Detected Bias Types
powershell_heavy
windows_tools
windows_first
missing_linux_example
Summary
The documentation provides both shell (Linux/Unix) and PowerShell (Windows) examples for uploading files and running Hive/Pig jobs. However, there is a strong emphasis on PowerShell throughout, with extensive PowerShell scripts and troubleshooting steps tailored to Windows users. Windows-specific issues (like CRLF line endings) are discussed in detail, and Windows tools (PowerShell, Azure Cloud Shell) are mentioned frequently. In some sections, PowerShell examples are more detailed or appear before shell examples. There is limited coverage of Linux-specific workflows, and troubleshooting for Linux users is minimal.
Recommendations
- Provide parity in troubleshooting steps for Linux users, such as how to fix line ending issues using Linux tools (e.g., dos2unix, sed).
- Include more detailed Linux shell examples for uploading files and running jobs, matching the depth of PowerShell scripts.
- Ensure that Linux/Unix workflows are presented first or equally alongside Windows/PowerShell workflows.
- Mention Linux tools and editors (e.g., nano, vim, dos2unix) when discussing file editing and preparation.
- Clarify which steps are platform-agnostic and which are platform-specific, and provide alternatives for both environments.
- Add troubleshooting tips for common Linux issues (e.g., file permissions, SSH key problems) relevant to HDInsight usage.
Create Pull Request