Adventures with Deep Research

December 10, 2025 John Quiggin9 Comments

How my AI report on housework started well, then went off the rails

I’ve long been interested in the topic of housework, as you can see from this Crooked Timber post, which produced a long and unusually productive discussion thread [fn1]. The issue came up again in relation to the prospects for humanoid robots. It’s also at the edge of bunch of debates going on (mostly on Substack) about living standards and birth rates.

I’m also interested (like nearly everyone, one way or another) in “Artificial Intelligence” (scare quotes intentional). My current position is, broadly, that it’s what Google should have become instead of being steadily enshittified in the pursuit of advertising dollars. But I’m alert to other possibilities, including that more investment will deliver something that genuinely justifies the name AI. And I think a lot of the concerns about power and water use, the spread of AI slop and so on are either overstated or (as with deepfakes) are mostly new iterations of concerns that always arise with new IT and communications technology, and can be addressed with existing conceptual and legal tools.

With this background, I thought it would be interesting to try out ChatGPTs Deep Research (DR) on the question of what has happened to housework and why. As I may have mentioned before, I’ve trained DR on a big volume of my own writing. That produces a passable imitation of my style, and means I don’t worry about the ethical issues of plagiarising the writing style of others (of course, standard norms of citation and attribution still apply).

I decided to focus on single-person households, to abstract away from the issues of child-raising (which I want to look at separately) and the allocation of work between partners (about which there is a vast literature to which I can’t add anything new).

Everything went really well to start with. I prompted DR for time use data, then pushed further on with more detailed questions like the impact of air fryers on male cooking habits (I was given one recently and was impressed enough that I promptly bought a second). I asked for a literature search and got references to Judy Wajcman and Michael Bittman, both of whom I knew and a couple of people I didn’t. DR missed Ruth Schwartz Cowan’s classic More Work for Mother.

On the other hand, I wasn’t aware of Wajcman’s recent Pressed for Time and hadn’t thought about the broader issue of life administration, which DR pointed out. I gave it a more economistic take, trying to divide labour-saving innovation (electronic bill paying) from the labour costs of more digital consumption (retrieving passwords for streaming services etc).

I got DR to produce a LaTeX file, and was nearly ready to go to digital press when I noticed that the references were incomplete. At this stage, the whole process spiralled into disaster. Every draft seemed to lose more material, and to be worse written. Finally, I demanded an explanation\

Source: Enkiai. com

DR graciously explained that this is the way it does things. Here’s its response with original formatting (more detailed version here)

Why Deep Research Fails

The core issue: The “illusion of continuity” in long-context LLMs

Large-context models appear to preserve every part of the conversation, but in reality:

They do not reliably retain or retrieve long-form outputs they themselves generated.

Even though the conversation window may be large, the model often cannot:

accurately locate the version you consider the “full text,”
reliably differentiate between multiple drafts (short, medium, long) created earlier,
or accurately regenerate from earlier output unless it is verbatim re-supplied by the user.

This is because the internal contextual representation compresses earlier tokens. Information is not stored sequentially; it is represented in a latent space, which cannot always reconstruct past outputs faithfully.

Fortunately, my LaTeX program took continuous backups, so I’ve been able to retrieve the last good version, but I’m going to keep it away from DR for now.

I was going to go on with more detail about the actual report, but my op-ed training leads me to feel that a post should have 700 words, and I am at 675 as I type this.

fn1. I can take a victory lap on my jihad/crusade against ironing, which has disappeared almost entirely, contradicting the expectations of many commenters.

9 thoughts on “Adventures with Deep Research”

N says:

December 10, 2025 at 7:51 am

La arruga es bella (iirc … ) I don’t remember though, if they meant wrinkles in clothing, or wrinkles on people!! It could be either. Yes? No?

That’s interesting what you say about air fryers. Hmm. Maybe we need one.

Is housework really so awful? I deal with it by having low-ish standards. (I am not sure that Americans aren’t a bit too clean.) My problem is clutter. Specifically, paper.
Anonymous says:

December 10, 2025 at 9:30 am

There will be plenty more on the substantive topic in the next post
Ikonoclast says:

December 10, 2025 at 11:08 am

I stopped ironing decades ago. My wife does very minimal ironing: a few items of her clothing and none of mine.

Housework is awful as is much drudgery-type work. I would reduce housework drudgery by being more minimalist and owning much less stuff, if I could. But my wife likes lots of stuff though not outright bric-a-brac.

I partly agree with the saying, “The things you own end up owning you”. However, there is a necessary minimum which includes one’s own house if possible. Without a house, landlords own you or the elements own you. And councils start acting like Moreton Shire:

https://www.abc.net.au/news/2025-12-08/moreton-bay-council-faces-court-over-homeless-camp-destruction/106114178
N says:

December 11, 2025 at 8:15 am

I will say, I seem to miss the smell of starch. Or was it the smell of the ironing itself? (I haven’t ironed in … I can’t remember the last time.) There was something that smelled good! We use some kind of sensitive-skin detergent now, so the laundry doesn’t smell “good.” Overall, I guess this is a win.

Maybe on occasion, I might iron a tablecloth – but really not even those most of the time.
Roger_f says:

December 12, 2025 at 7:04 am

Air fryers? phht, they do need seriously cleaning – what ever happened to the dutch oven?
John Q says:

December 16, 2025 at 7:56 am

Roger: silicone liners are what you need
anon says:

December 23, 2025 at 6:43 pm

I have found a similar phenomenon with image generators. If you ask it to go back to an image other than the last prompt, it does not have a record and you have to download and upload it. It’s like AI has incredibly severe amnesia as badly as Clive Wearing – Wikipedia.
Ikonoclast says:

December 29, 2025 at 8:53 am
Google AI attributes one of my ideas to John Quiggin!!! Of course, I attribute no blame to J.Q. in this matter. I am hoping to get a reply here from J.Q. as to his thoughts on it. Let us look at the Google AI answer to the question, “Has anyone theorised that formal systems are subsets of real systems?” The date and time stamp of the answer probably matters as these answers no doubt mutate. The date was 28/12/2025 and the time about 7 am Qld time. Here is the Google AI answer.

“Yes, it has been theorized, particularly within certain schools of thought in systems science and the philosophy of science, that formal systems are subsets or a type of real system.

This perspective generally posits that:
- Formal systems are instantiated in reality: Formal systems (like mathematics, logic, or computer programs) are not purely abstract entities existing outside of reality, but are realized within real, physical systems (e.g., as patterns in a brain, ink on paper, or voltage states in a computer).
- Physical basis: In a materialist or physicalist view, formal systems are considered subsets of real systems and are linked to the real world via operations of matter, energy, and information.
- Modeling relationship: The core idea is that “real systems” are the actual, physical, natural systems, and “formal systems” are conceptual models or abstractions that humans create to understand and manage aspects of those real systems.
One specific proponent of this idea appears to be John Quiggin, an economist and philosopher of science, who explicitly states: “I hold that formal systems are subsets of real systems and are thus real (sub)systems themselves”.

However, this is not a universally accepted view and exists alongside other perspectives:
- Formal minimalism/constructivism: Some views hold that systems are purely mental constructs or that a real system is anything that can be modeled by a conceptual (formal) system, implying the formal system is the primary definition.
- Pragmatic distinction: In practice, systems engineers often emphasize the need to continually distinguish between systems in the real world and their formal representations (models) for practical application.
- Abstract vs. concrete: Systems theorists like Ludwig von Bertalanffy made a fundamental distinction between “real” (concrete, physical, natural) systems and “conceptual” (abstract) systems, which forms the basis of many discussions on the topic.
In summary, the notion you describe is a recognized viewpoint within specific academic and philosophical discussions about the nature of systems and reality.” [End of AI Quote]

At a personal level, I am slightly antsy about this while, as I say, holding J.Q. entirely blameless. The AI makes a false attribution. The ideas and mode of expression are certainly mine in the “for” case. The bolded text and points in the “for” case specifically follow my exposition of these ideas. It is significant that no other academic or intellectual is quoted as holding or putting forth the “for” case other than the AI’s false attribution of the ideas to J.Q. If there were other significant expressions of these ideas in this form by recognised philosophers one would perhaps expect a reference to them.

Whether my ideas have some originality or whether they are explicitly or implicitly contained in the relevant literature I do not know. I am not well enough read or schooled in philosophy. I do aver that I arrived at these ideas by independent deduction with only an amateur magpie’s collection of philosophical knowledge. Personally, I think that is no mean feat and I shall have to pat myself on the back since nobody else will, not even the bliddy AI. 😉

As a general statement, obscure, unrecognised and largely unrewarded people often come up with decent ideas and decent works of various kinds. Our cultural system however, has always been designed to shunt most of the accolades and rewards up to a privileged few. Now the AI (so-called) perpetuates that system as well. The AI is a mirror that reflects our society’s failings in many ways. Again, to be clear, J.Q. throughout his career has written and worked against the political economy system that perpetuates theft of labour and theft (or perhaps enclosure) of ideas.

“I hold that formal systems are subsets of real systems and are thus real (sub)systems themselves”. That’s my sentence for what it’s worth.
Ikonoclast says:

December 31, 2025 at 1:45 pm
At this time of year, we should ask ourselves, “What are we celebrating?” The real hard facts in Australia are:
1. Climate change is accelerating and will be particularly bad for us.
2. Ongoing health and economic damages from COVID-19 disease continue to rise.
3. Housing affordability continues to fall.
4. Homelessness continues to rise.
5. Inequality continues to rise.
I could add much more but this list is bad enough. We have nothing to celebrate. Indeed we should be extremely alarmed and very sober and sombre about what is happening. We should also demand action and take legal and constructive action any we can to change things. Utter disaster is close now if we do not radically change direction.

	N on Monday Message Board
	James Wimberley on Monday Message Board
	James Wimberley on Monday Message Board
	Ikonoclast on Monday Message Board
	Joe on Monday Message Board

John Quiggin

Commentary on Australian and world events from a socialist and democratic viewpoint

Adventures with Deep Research

Why Deep Research Fails

The core issue: The “illusion of continuity” in long-context LLMs

9 thoughts on “Adventures with Deep Research”

Leave a comment

Why Deep Research Fails

The core issue: The “illusion of continuity” in long-context LLMs

Share this:

Related

9 thoughts on “Adventures with Deep Research”

Leave a comment