Skip to content

Navigation Menu

AlignmentResearch

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

FAR.AI

Frontier alignment research to ensure the safe development and deployment of advanced AI systems.

122 followers
https://far.ai
@FARAIResearch
company/far-ai
@FARAIResearch
hello@far.ai

Overview
Repositories
Projects
Packages
People

More

Overview
Repositories
Projects
Packages
People

Popular repositories Loading

tuned-lens tuned-lens Public

Tools for understanding how transformer predictions are built layer-by-layer

Python 463 48
go_attack go_attack Public

Python 84 7
vlmrm vlmrm Public

Python 47 12
gpt-4-novel-apis-attacks gpt-4-novel-apis-attacks Public

19 1
learned-planner learned-planner Public

Interpretability tools for recurrent networks that play Sokoban

Python 10 2
scaling-poisoning scaling-poisoning Public

Python 6

Repositories

Loading

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All C++ Dockerfile Go HTML Java Jinja Jupyter Notebook Python Shell

Sort

Select order

Last updated Name Stars

Showing 10 of 34 repositories

HarmBench Public Forked from centerforaisafety/HarmBench
Fork of HarmBench for getting R2D2 working

AlignmentResearch/HarmBench’s past year of commit activity

Jupyter Notebook 0 MIT 64 0 0 Updated Feb 1, 2025
train-learned-planner Public
Experimenting with CleanRL for learned-planners

AlignmentResearch/train-learned-planner’s past year of commit activity

Python 4 0 1 2 Updated Jan 31, 2025
KataGoVisualizer Public

AlignmentResearch/KataGoVisualizer’s past year of commit activity

HTML 3 MIT 1 6 0 Updated Jan 29, 2025
gym-sokoban Public
Sokoban environment for Gym

AlignmentResearch/gym-sokoban’s past year of commit activity

Python 0 MIT 0 0 0 Updated Jan 27, 2025
oauth2-proxy-buildpack Public
Fork of https://github.com/cfra/heroku-buildpack-oauth2-proxy for internal use

AlignmentResearch/oauth2-proxy-buildpack’s past year of commit activity

Shell 0 Apache-2.0 0 0 0 Updated Jan 27, 2025
farconf Public
Easy dataclass-based configuration for ML projects

AlignmentResearch/farconf’s past year of commit activity

Python 1 0 0 0 Updated Jan 17, 2025
go_attack Public

AlignmentResearch/go_attack’s past year of commit activity

Python 84 MIT 7 12 0 Updated Jan 15, 2025
KataGo-custom Public
Child repository of https://github.com/HumanCompatibleAI/go_attack.

AlignmentResearch/KataGo-custom’s past year of commit activity

C++ 5 1 6 0 Updated Jan 15, 2025
envpool Public Forked from sail-sg/envpool
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

AlignmentResearch/envpool’s past year of commit activity

C++ 0 Apache-2.0 111 0 0 Updated Jan 15, 2025
learned-planner Public
Interpretability tools for recurrent networks that play Sokoban

AlignmentResearch/learned-planner’s past year of commit activity

Python 10 Apache-2.0 2 0 0 Updated Jan 15, 2025

View all repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.