Ethics Advanced Quiz 2

Select your answers and check your results. Use Reset to start again.
Practice Pronunciation (Merriam-Webster)
Navigation
Advanced Quiz 2
1. What is adversarial robustness testing?
2. How can red-teaming help surface safety issues in powerful AI systems?
3. What is scalable oversight?
4. How can weak-to-strong generalization support scalable oversight?
5. What is reinforcement learning from human feedback (RLHF)?
6. What is a limitation of RLHF for alignment?
7. What does reward modeling try to achieve in AI safety?
8. How can AI systems be trained to express uncertainty?
9. What is impact regularization in AI safety?
10. What is corrigibility in the context of AI?
Previous Next
Other
Timer
00:00

Vocabulary Quiz
Score: 0

Spin the Wheel
Promo's

Explore More