You can usually even ask the same LLM: - do a task - criticize your job on that ...

roughly · 2024-10-13T16:14:29 1728836069

What’s fun is that you can skip step 1. The LLM will happily critique its own nonexistent output.

zmgsabst · 2024-10-14T21:23:19 1728940999

So?

I too can write made up criticism if that’s what my boss wants in the workplace — but that doesn’t suddenly invalidate my ability to criticize my own work to improve it.

iSnow · 2024-10-13T17:13:52 1728839632

That's a smart idea I didn't think of.

I've been arguing with Copilot back and forth where it gave me a half-working solution that seemed overly complicated but since I was new to the tech used, I couldn't say what exactly was wrong. After a couple of hours, I googled the background and trust my instinct and was able to simplify the code.

At that situation, where I iteratively improved the solution by telling Copilot things seem to complicated and this or that isn't working. That led the LLM to actually come back with better ideas. I kept asking myself why something like you propose isn't baked into the system.

drawnwren · 2024-10-13T17:57:44 1728842264

The papers I've read have shown LLM critics to be quite bad at their work. If you give an LLM a few known good and bad results, I think you'll see the LLM is just as likely to make good results bad as it is to make bad results good.

blazing234 · 2024-10-13T15:29:58 1728833398

How do you know the second result is correct? Or the third? Or the fourth?

phil-martin · 2024-10-13T17:12:30 1728839550

I approach it the same way as the things I build myself - testing and measuring.

Although if I’m truly honest with myself, even after many years of developing, the true cycle of me writing code is: over confidence, then shock it didn’t work 100% the first time, wondering if there is a bug in the compiler, and then reality setting in that of course the compiler is fine and I just made my 15th off-by-one error of the day :)