Community Hub

Connect, collaborate, and learn with the AI safety community

Interpretability Research Circle

47 members • High activity

Focused on developing and sharing techniques for understanding neural network internals, with emphasis on language models.

InterpretabilityLanguage ModelsNeural Networks
Last active: 2 hours ago

AI Governance Working Group

38 members • Medium activity

Discussing policy approaches, regulatory frameworks, and governance structures for advanced AI systems.

GovernancePolicyRegulation
Last active: 1 day ago

Alignment Theory Discussion

52 members • High activity

Exploring theoretical approaches to aligning AI systems with human values and intentions.

AlignmentValue LearningTheoretical Approaches
Last active: 5 hours ago

AI Safety Beginners

124 members • Very High activity

Support group for newcomers to the field. Ask questions, share resources, and build foundational knowledge.

EducationCareer DevelopmentFundamentals
Last active: 20 minutes ago

ML Safety Engineering

35 members • Medium activity

Technical discussions on implementing safety measures in machine learning systems. Code sharing and practical approaches.

ML EngineeringRobustnessTesting
Last active: 2 days ago

AI Safety Reading Group

63 members • Medium activity

Weekly discussions of key papers and books in AI safety. Current focus: mechanistic interpretability literature.

Literature ReviewResearch DiscussionInterpretability
Last active: 3 days ago