### Description

• Help train large-language models (LLMs) to write production-grade code across a wide range of programming languages: 

• Compare & rank multiple code snippets, explaining which is best and why. 

• Repair & refactor AI-generated code for correctness, efficiency, and style. 

• Inject feedback (ratings, edits, test results) into the RLHF pipeline and keep it running smoothly. 

• Generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship. 

### Requirements

• 4+ years of professional software engineering experience in one or more of the following: 

• Python, Java, JavaScript, TypeScript, Go, C++, PHP, COBOL, C, Ruby, or Rust. 

• Strong code-review instincts—you can spot logic errors, performance traps, and security issues quickly. 

• Extreme attention to detail and excellent written communication skills. 

• You enjoy reading documentation and language specs and thrive in an asynchronous, low-oversight environment.

• No prior RLHF (Reinforcement Learning with Human Feedback) or AI training experience. 

• No deep machine learning knowledge. If you can review and critique code clearly, we’ll teach you the rest.

Python Software Engineer - A

Description

Requirements