Job Detail

Reinforcement Learning Infrastructure (Cybersecurity)

Posted 1 day ago
$176.4k–242.6k / year
Remote: United States
Full-Time

We are Bugcrowd, a company that empowers organizations to take back control and stay ahead of threat actors by uniting the collective ingenuity and expertise of our customers and trusted alliance of elite hackers. Our network of hackers brings diverse expertise to uncover hidden weaknesses, adapting swiftly to evolving threats. Visit www.bugcrowd.com to learn more.

Job Summary

The Bugcrowd RL and Reasoning Team focuses on pushing the boundaries of autonomous cybersecurity by building authentic reinforcement learning environments for foundational model companies. As a Staff Engineer, you will advance the frontier of AI Reinforcement Learning development and delivery. You will build the infrastructure and tooling that transforms real-world vulnerability research into large-scale reinforcement learning environments used to train next-generation AI systems.

Essential Duties and Responsibilities

  • Design pipelines that ingest software projects, analyze them with Bugcrowd’s Mayhem platform, and automatically construct training environments used by frontier AI labs including Anthropic, OpenAI, and Cohere.
  • Build high-performance systems that power cutting-edge AI research.
  • The ideal candidate is a strong systems engineer who understands:
  • Reinforcement learning workflows
  • Building clean, reproducible Linux ML environments (containers, MCP, etc)
  • System security background in binary exploitation, such as buffer overflows, fuzzing, exploitation, and x86/64.
  • Experience developing applications in Python and C, with Rust a plus.

Education, Experience, Knowledge, Skills, and Abilities

  • Understanding of RL training workflows used by modern LLM systems
  • Experience with DevOps pipelines (e.g., github actions), reproducible builds (docker, buildkit, nix).
  • Proficiency in Python and C. Other languages (especially Rust) are a plus.
  • Understanding of software vulnerabilities, fuzzing, or program analysis
  • Experience with build systems and large open-source codebases
  • Comfort working with Linux systems and low-level debugging

Working Conditions and Physical Requirements

  • The ideal candidate must be able to complete all physical requirements of the job with or without reasonable accommodation.
  • Sitting and/or standing - Must be able to remain in a stationary position 50% of the time
  • Carrying and/or lifting - Must be able to carry/move laptop as needed throughout the work day.
  • Environment - remote, work-from-home 100% of the time.

ADA Statement

Bugcrowd is committed to the full inclusion of all qualified individuals. In keeping with our commitment, Bugcrowd will take the steps to assure that people with disabilities are provided reasonable accommodations. Please contact HR at ada@bugcrowd.com for more information.

Pay Range Disclosure

The national estimate for the current base range for the position of $176,400 - $242,550. This position may also be eligible to participate in a discretionary bonus program or commission plan.

Culture

At Bugcrowd, we understand that diversity in the workplace is vital to a company’s success and growth. We strive to make sure that people are included and have a sense of being part of making Bugcrowd not only a great product but a great place to work.

Disclaimer

This position has access to highly confidential, sensitive information relating to the technologies of Bugcrowd. It is essential that the applicant possess the requisite integrity to maintain the information in the strictest confidence.

Equal Employment Opportunity

Bugcrowd is EOE, Disability/Age Employer. Individuals seeking employment at Bugcrowd are considered without regards to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation.

Apply at: https://www.bugcrowd.com/about/careers/

Bugcrowd
United States
View company profile
Share this job