• A_A@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    ·
    5 个月前

    Looks quite satisfying to me, otherwise, we can still create new tests … :

    The tests cover an astounding range of knowledge, such as eighth-grade math, world history, and pop culture. Many are multiple choice, others take free-form answers. Some purport to measure knowledge of advanced fields like law, medicine and science. Others are more abstract, asking AI systems to choose the next logical step in a sequence of events, or to review “moral scenarios” and decide what actions would be considered acceptable behavior in society today.