Memory copy without cache involved

icoming · Mar 19, 2011

Hello,

I'm searching for the way to do memory copy without cache involved. I want to copy data from one location of physical memory to another location of physical memory, i.e., I want real memory copy instead of virtual memory mapping. Does Intel processor provide instructions to do that? In some case the program just needs to do memory copy and the data isn't going to be used in the future, so there is no reason to copy it to the cache and pollute cache.

Thanks,
Da

Zenthar · Mar 19, 2011

I haven't got this low level in a while, but that kind of think might also be processor/architecture specific.

I found that, hope it can be a starting point.

icoming · Mar 19, 2011

I agree that this is architecture specific. That's why I ask if Intel processors provide such instructions to do that. It's very likely only some of Intel processors can do it.

Ijack · Mar 19, 2011

Yes. Just set up entries in the Page Tables pointing to the locations in question and then use any of the MOVS instructions to do the move. Of course, you'll have to be running the processor at the highest privilege level - i.e. kernel code.

icoming · Mar 21, 2011

Do I need to set up the entries in the page table? Isn't it done automatically when the memory is allocated and accessed?
Why does the processor need to be at the privilege level?

Ijack · Mar 21, 2011

Because you are asking to move memory between physical memory locations rather than virtual ones. Only the kernel of the Operating System, running at the highest level, can access physical memory directly. User programs can only access it indirectly via the paging mechanism - that's what virtual memory mapping is.

To access the physical memory directly you need the appropriate entries in the Page Table pointing to that memory. Without manipulating the Page Tables you don't know what physical memory a virtual memory address is pointing at. Obviously only privileged code is allowed to manipulate those tables. If user programs could manipulate physical memory directly it would be possible to bypass security restrictions and it would make the Operating System unstable.

icoming · Mar 21, 2011

oh, sorry, then I didn't get my question clear. What I meant is that I need to do real memory copy instead of mapping two virtual memory addresses to the same physical memory. I don't need to address any specific physical memory addresses. The memory is still addressed with the virtual memory address.

Ijack · Mar 21, 2011

You've lost me now. I'm not sure what you are talking about.

If you want to do a simple memory copy then just use one of the MOVS instructions.

icoming · Mar 22, 2011

What I really care is how to copy data without loading data to cache. I don't want cache to be polluted.

MOVS instructions will load data to the cache, right? I'm currently using memcpy to do memory copy, and it seems memcpy in GNU C library for x64 does use MOVS to copy data, and VTune shows me that data is loaded to cache when memcpy is used.

Ijack · Mar 22, 2011

Ah, that's a different problem. You can use the MOVNTI instruction in conjunction with the SFENCE instruction to avoid polluting the data cache. But you have to balance that with the fact that these instructions are not as efficient as the MOVS ones. It would be an interesting experiment but I would guess that this inefficiency would outweigh any savings made by not having to reload data into the data cache. Bear in mind that the various pipelines in the processor operate in parallel, so the penalty of having to reload the cache may not be as significant as you think.

I'm fairly sure that the guys who wrote the GNU Standard C library have thought of these things. But only benchmarking tests could tell. Try writing a routine using these instructions to see if you can significantly improve on the library routine.

icoming · Mar 22, 2011

I'm working on Atom processors, which uses in-order architecture. It runs at a fairly high frequency, but the memory bus is slow, so I'm thinking maybe I can improve the performance by avoiding polluting cache.
Thanks, I'll try.

icoming · Mar 22, 2011

Best answer selected by icoming.

Search

Memory copy without cache involved

icoming

Distinguished

Ijack

Zenthar

Distinguished

icoming

Distinguished

Ijack

Distinguished

icoming

Distinguished

Ijack

Distinguished

icoming

Distinguished

Ijack

Distinguished

icoming

Distinguished

Ijack

Distinguished

icoming

Distinguished

icoming

Distinguished

Similar threads

TRENDING THREADS

Share this page