i am writing a device driver on linux-2.6.26. I want to have a dma buffer mapped into userspace for sending data from driver to userspace application. Please suggest some good tutorial on it.
Thanks
Here is what I have used, in brief...
get_user_pages
to pin the user page(s) and give you an array of struct page *
pointers.
dma_map_page
on each struct page *
to get the DMA address (aka. "I/O address") for the page. This also creates an IOMMU mapping (if needed on your platform).
Now tell your device to perform the DMA into the memory using those DMA addresses. Obviously they can be non-contiguous; memory is only guaranteed to be contiguous in multiples of the page size.
dma_sync_single_for_cpu
to do any necessary cache flushes or bounce buffer blitting or whatever. This call guarantees that the CPU can actually see the result of the DMA, since on many systems, modifying physical RAM behind the CPU's back results in stale caches.
dma_unmap_page
to free the IOMMU mapping (if it was needed on your platform).
put_page
to un-pin the user page(s).
Note that you must check for errors all the way through here, because there are limited resources all over the place. get_user_pages
returns a negative number for an outright error (-errno), but it can return a positive number to tell you how many pages it actually managed to pin (physical memory is not limitless). If this is less than you requested, you still must loop through all of the pages it did pin in order to call put_page
on them. (Otherwise you are leaking kernel memory; very bad.)
dma_map_page
can also return an error (-errno), because IOMMU mappings are another limited resource.
dma_unmap_page
and put_page
return void
, as usual for Linux "freeing" functions. (Linux kernel resource management routines only return errors because something actually went wrong, not because you screwed up and passed a bad pointer or something. The basic assumption is that you are never screwing up because this is kernel code. Although get_user_pages
does check to ensure the validity of the user addresses and will return an error if the user handed you a bad pointer.)
You can also consider using the _sg functions if you want a friendly interface to scatter/gather. Then you would call dma_map_sg
instead of dma_map_page
, dma_sync_sg_for_cpu
instead of dma_sync_single_for_cpu
, etc.
Also note that many of these functions may be more-or-less no-ops on your platform, so you can often get away with being sloppy. (In particular, dma_sync_... and dma_unmap_... do nothing on my x86_64 system.) But on those platforms, the calls themselves get compiled into nothing, so there is no excuse for being sloppy.