Tindytim
Distinguished
[citation][nom]afrobacon[/nom]whats the fastest card Intel has out right now? How does it compare to Nvidia/ATI?[/citation]
Intel doesn't have a single card out. Just a bunch of integrated solutions. None of which compete with any serious solutions, but none of them were meant to.
Although I found something rather interesting when I did a bit of research into the matter:
I'd like to have faith in Intel, considering their great success with their SSDs on their first try. But that was a relatively new technology, that doesn't have a large market for it. So I can only hope it does well.
Intel doesn't have a single card out. Just a bunch of integrated solutions. None of which compete with any serious solutions, but none of them were meant to.
Although I found something rather interesting when I did a bit of research into the matter:
http/en.wikipedia.org/wiki/Larrabee_(GPU)Comparison with the Cell Broadband Engine
Larrabee's philosophy of using many small, simple cores is similar to the ideas behind the Cell processor. There are some further commonalities, such as the use of a high-bandwidth ring bus to communicate between cores. However, there are many significant differences in implementation which should make programming Larrabee simpler.
* The Cell processor includes one main processor which controls many smaller processors. Additionally, the main processor can run an operating system. In contrast, all of Larrabee's cores are the same, and the Larrabee is not expected to run an OS.
* Each compute core in the Cell (SPE) has a local store, for which explicit (DMA) operations are used for all accesses to DRAM. Ordinary reads/writes to DRAM are not allowed. In Larrabee, all on-chip and off-chip memories are under automatically-managed coherent cache hierarchy, so that its cores virtually share a uniform memory space through standard load/store instructions.
* Because of the cache coherency noted above, each program running in Larrabee has virtually a large linear memory just as in traditional general-purpose CPU; whereas an application for Cell should be programmed taking into consideration limited memory footprint of the local store associated with each SPE (for details see this article) but with theoretically higher bandwidth.
* Cell uses DMA for data transfer to/from on-chip local memories, which has a merit in flexibility and throughput; whereas Larrabee uses special instructions for cache manipulation (notably cache eviction hints and pre-fetch instructions), which has a merit in that it can maintain cache coherence (hence the standard memory hierarchy) while boosting performance for e.g. rendering pipelines and other stream-like computation.
* Each compute core in the Cell runs only one thread at a time, in-order. A core in Larrabee runs up to four threads. Larrabee's hyperthreading helps hide latencies and compensates for lack of out-of-order execution.
I'd like to have faith in Intel, considering their great success with their SSDs on their first try. But that was a relatively new technology, that doesn't have a large market for it. So I can only hope it does well.