Hacker News new | past | comments | ask | show | jobs | submit login

So he's essentially using the OS's disk cache as his cache, and tracking the disk files substituting for manual memory allocation.

In this situation on .NET, I wrote a resource manager that handed out handles to large byte arrays (in particular, ones that were too big to be collected outside of gen2, in the large object heap, which kicked in at 80kb at the time IIRC). The handles implemented IDisposable so taking care of handing back the byte array was no more or less tedious than any other resource you need to manage explicitly. The resource manager kept a hold of the arrays internally using a weak pointer so they could still be collected when gen2 collections actually happened, but allocating the buffers themselves would never cause gen2 collections in a steady state.

To turn that into a cache, you'd need another layer with keys, an eviction policy and an invalidation mechanism. I think it ought still be better than round-tripping to disk.

I wrote a different version of the resource manager that used P/Invoke helpers and unsafe code to allocate from unmanaged memory directly, but it didn't perform any better - it didn't relieve any pressure on the GC, which was 2% of CPU usage at full load in any case.




> So he's essentially using the OS's disk cache as his cache, and tracking the disk files substituting for manual memory allocation.

And that's often not the worst idea, because, when done correctly, this stuff never hits the disk when enough RAM is around.

Plus, life cycle management is done by the OS, not by you. It also tends to work better than "not at all" on memory pressure or if there isn't a lot of memory in the first place.


If you write a file to disk, then wait fifteen minutes before deleting it, chances are the OS will have flushed it to disk sometime in the meantime. It won't have to re-read it if there's still free memory, but it's extra load on disk io.

If you write a lot of them, then even if you have the memory, you may overrun the size limit of the write buffer and cause application stalls.

Writing to a ramdisk (e.g. tmpfs) is always an option, though.


Every OS has flags for this. Windows' CreateFile has a _TEMPORARY flag and most unixes have something like O_TMPFILE.


It'll be flushed to disk within seconds.

I agree in a memory-rich environment it's far from the worst idea. However now you need to manage the files. You've just pushed the problem somewhere else.


Every OS has flags for this. Windows' CreateFile has a _TEMPORARY flag and most unixes have something like O_TMPFILE. These avoids flushing them to disk, even on a background writer timeout (Linux/Windows). Further, on many *nix systems /tmp and the like are often in-memory FSes.


.net doesn't expose it.


That sucks :(




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: