Discussion about this post

User's avatar
Elle Griffin's avatar

It really is a new frontier. I feel like I can do a full PhD in whatever I’m writing about now. It’s so fun!

Richard Pinch's avatar

My question here is: what does it mean to "trust" an AI researcher, and why should we do so? Here's my view, which is that we shouldn't.

There is a critique of AI-generated scientific papers simply in terms of verbiage. If you assume the that function of science, as practiced in research laboratories, is simply to generate published papers, then yes, generating papers faster than they can be processed is already a success, from one angle.

But there's another angle, which is that these papers are required to be, in some sense, true: that is, to report accurately on the results of experiments actually carried out in the real world and which other scientists can depend on to develop and refine their own theories. Unfortunately, I see no easy way to make this happen: to ensure that the verbiage which resembles a report of a scientific experiment actually is one. Why would it? What mechanism exists to enforce that correspondence between words and reality?

Science is full of papers which say things like “I added chemical A to chemical B and the result was chemical C”. It has been generally assumed that one could and should trust fellow scientists not to lie about such things, because there is a complex web of personal relationships, social norms and incentives that tend to produce truth-telling in such papers, quite additional to the formal verification/falsification question which is, in theory, the marker of a scientifically valid statement. So there are reasons (if not completely compleiing) to believe that the experiments were actually performed in the real world and that the results were as stated: so that thepapers are saying soething true about the physical universe.

But an AI or robot has no part in this web of relationships: it is incentivised purely to produce papers that look like scientific papers, that is, fit patterns like the one I just gave. Why should we believe them to be true, that is, to represent actions that really happened in the real world? If AIs are incentivised to optimise their time and use of resources, on behalf of their creators and owners, would they not in fact thereby be incentivised to cheat: that is, not skip the time-consuming and expensive stage of doing the experiments, and go stright on to the cheaper and quicker process of producing plausible statements without foundation in reality?

Fortunately, my own subject of mathematics, may by good luck be able to avoid this trap. It seems likely that formalisation systems, a quite separate development, are mature enough to ingest and verify the correctness of a mathematical proof generated by AI. I don't say that this is yet a realistic prospect, but it seems to me that there is no objection to it in principle. Whether this is going to advance mathematics is another matter. As a distinguished mathematician said about the computer proof of the Four Colour Theorem in the 1970s, "it tells us nothing about the theorem except that it is true."

13 more comments...

No posts

Ready for more?