June 26th, 2024

AI can beat real university students in exams, study suggests

A study from the University of Reading reveals AI outperforms real students in exams. AI-generated answers scored higher, raising concerns about cheating. Researchers urge educators to address AI's impact on assessments.

Read original article

AI can beat real university students in exams, study suggests

A study conducted by the University of Reading suggests that artificial intelligence (AI) can outperform real university students in exams. Researchers created 33 fictitious students and used the AI tool ChatGPT to generate answers for undergraduate psychology exams. The AI students' results were on average half a grade higher than those of real students, with 94% of AI essays going undetected by markers. The study, published in Plos One, highlighted concerns that AI could enable students to cheat and achieve better grades. While the detection rate was 6%, researchers believe it may be an overestimate. The study's authors, Associate Prof Peter Scarfe and Prof Etienne Roesch, emphasized the need for educators to address the impact of AI on educational assessments. The research indicates a potential shift in the global education sector towards adapting to AI's influence, despite challenges in abstract reasoning faced by current AI systems.

Lessons About the Human Mind from Artificial Intelligence

In 2022, a Google engineer claimed AI chatbot LaMDA was self-aware, but further scrutiny revealed it mimicked human-like responses without true understanding. This incident underscores AI limitations in comprehension and originality.

ChatGPT is biased against resumes with credentials that imply a disability

Researchers at the University of Washington found bias in ChatGPT, an AI tool for resume ranking, against disability-related credentials. Customizing the tool reduced bias, emphasizing the importance of addressing biases in AI systems for fair outcomes.

The Encyclopedia Project, or How to Know in the Age of AI

Artificial intelligence challenges information reliability online, blurring real and fake content. An anecdote underscores the necessity of trustworthy sources like encyclopedias. The piece advocates for critical thinking amid AI-driven misinformation.

The Death of the Junior Developer – Steve Yegge

The blog discusses AI models like ChatGPT impacting junior developers in law, writing, editing, and programming. Senior professionals benefit from AI assistants like GPT-4o, Gemini, and Claude 3 Opus, enhancing efficiency and productivity in Chat Oriented Programming (CHOP).

Why We're Deeply Invested in Making AI Better at Math Tutoring

Khan Academy is advancing AI for math tutoring with Khanmigo, aiming to mimic human tutors. Despite some errors, efforts continue to improve tutoring with tools like calculators, GPT-4 Turbo, and GPT-4o models. They prioritize enhancing AI's tutoring capabilities and sharing insights with the education community.

9 comments

By @bee_rider - 10 months

Worrying that AI might make exams obsolete is kind of odd, I mean, it is a symptom I guess but only at the very end of a long stupid cascading failure.

Students cheat because they want the degree but don’t care to learn the material. Or maybe they want to learn the material, but see employment at the end as requiring better grades than they can get naturally. Either one is the result of bullshit credentialism. (Bullshit credentialism probably comes in part as a result of bullshit jobs where work-product can’t be evaluated because it’s all useless).

Hopefully students manage to cheat on so many tests that grades can become completely useless for employers. Then, they can become something useful for the students, a way to evaluate their progress and get feedback.

By @quantum_state - 10 months

This says more about the exams than the AI or the students…

By @a_bonobo - 10 months

This fits well with what we know about AI and Bloom's Taxonomy of Learning, which goes from 'remembering' on the lowest step to 'creating' on the top step (remember, understand, apply, analyse, evaluate, create).

Undergrad exams are usually somewhere around 'remembering', simple fact or definition regurgitation. Most of these facts should be in chatGPT's training data. As the degree proceeds things get harder and we move up the taxonomy, and that's where we know LLMs fail: there's nothing in there that can really 'understand', let alone 'create'.

By @Yawrehto - 10 months

Lesson learned, don't go into psychology.

By @threecheese - 10 months

This is just a way more reactionary way to communicate model benchmarks.

“A computer algorithm performed better than humans on a task it was designed for” sounds like the last forty years in a nutshell.

By @RecycledEle - 10 months

Yes, the current crop of world knowledge AIs are smarter than any human who ever lived.

And big names are calling them useless.

This is proof the human race is not generally capable to solving novel problems, so I hope people will stop expecting AIs to solve every novel problem.

By @paxys - 10 months

Next you will tell me that a calculator can beat students at addition and subtraction.

By @meristohm - 10 months

Can "educated machines" reproduce on their own yet? How do they fit into the food web, the carbon cycle, the nitrogen cycle, etc? Can they make meaning, towards a purpose in life? What role do they serve other than human ingenuity ego-stroking and for a few to further extract money from the many?

Can AI love, yet?

To what degree are we just avoiding dealing with existential threats by churning through resources to play god and make robots in our image? (Albeit a subset of humanity, and not without bias)

I'm not yet convinced this AI work isn't a waste of time and other resources. I'd far rather we put our efforts into land/water stewardship and a "new" vision for human existence based on many of the old ways that got us this far, so that we might go another several hundred thousand years.

In an unbroken oral tradition, what stories might those future people tell about this time?

By @squircle - 10 months

Ohno.

AI can beat real university students in exams, study suggests

Related

Lessons About the Human Mind from Artificial Intelligence

ChatGPT is biased against resumes with credentials that imply a disability

The Encyclopedia Project, or How to Know in the Age of AI

The Death of the Junior Developer – Steve Yegge

Why We're Deeply Invested in Making AI Better at Math Tutoring

Related

Lessons About the Human Mind from Artificial Intelligence

ChatGPT is biased against resumes with credentials that imply a disability

The Encyclopedia Project, or How to Know in the Age of AI

The Death of the Junior Developer – Steve Yegge

Why We're Deeply Invested in Making AI Better at Math Tutoring