purehate, I need some help tuning my cards. Turbon shows above getting 23177.8 PMKs/s via 1 GeForce GTX470 card. (He didn't mention brand or model #)
I am using 4 cards doing 4-way SLI and this is my results:
Code:
# pyrit list_cores
Pyrit 0.4.0-dev (svn r288) (C) 2008-2010 Lukas Lueg http://pyrit.googlecode.com
This code is distributed under the GNU General Public License v3+
The following cores seem available...
#1: 'CUDA-Device #1 'GeForce GTX 470''
#2: 'CUDA-Device #2 'GeForce GTX 470''
#3: 'CUDA-Device #3 'GeForce GTX 470''
#4: 'CUDA-Device #4 'GeForce GTX 470''
#5: 'CPU-Core (SSE2)'
#6: 'CPU-Core (SSE2)'
#7: 'CPU-Core (SSE2)'
#8: 'CPU-Core (SSE2)'
# pyrit benchmark_long
Pyrit 0.4.0-dev (svn r288) (C) 2008-2010 Lukas Lueg http://pyrit.googlecode.com
This code is distributed under the GNU General Public License v3+
Running benchmark (28390.0 PMKs/s)... \
Computed 28389.98 PMKs/s total.
#1: 'CUDA-Device #1 'GeForce GTX 470'': 14888.2 PMKs/s (RTT 2.0)
#2: 'CUDA-Device #2 'GeForce GTX 470'': 14913.1 PMKs/s (RTT 2.0)
#3: 'CUDA-Device #3 'GeForce GTX 470'': 15051.0 PMKs/s (RTT 1.9)
#4: 'CUDA-Device #4 'GeForce GTX 470'': 15351.5 PMKs/s (RTT 1.9)
#5: 'CPU-Core (SSE2)': 117.6 PMKs/s (RTT 3.0)
#6: 'CPU-Core (SSE2)': 115.3 PMKs/s (RTT 3.0)
#7: 'CPU-Core (SSE2)': 118.0 PMKs/s (RTT 3.0)
#8: 'CPU-Core (SSE2)': 117.5 PMKs/s (RTT 2.9)
#
PSU: SilverStone (SST-ST1500) 1500 Watt PSU.
Motherboard: EVGA X58 4-Way SLI Certified.
CPU: Intel i7 950.
Ram: Corsair Xtreme Performance CMX4GX3M2A1600C9 4GB (2 x 2GB) PC3-12800 DDR3 1600MHz.
The exact model of the video cards are the EVGA 012-P3-1472-AR Geforce GTX 470 Fermi SC Edition (Factory SuperClocked) 1280MB GDDR5.
Precursor to buying everything above:
I had tested a single EVGA GTX 460 SC on an MSI 890FXA-GD70 motherboard using an AMD Phenom II 965 3.40GHZ true quad and got about 24,000PMKs/s. I was stoked and returned all of the gear and got the beefier 470's over the 460 and upgraded to a board that could hold 4 cards. (This was tested on bt4-r1, no compile of pyrit or cpyrit needed in this test case, automatically detected).
With the 4 cards I have tested on bt4-r1 which I had to compile from src and cpyrit-cuda and nvidia driver, which wasn't a big deal, just strange it didn't auto-detect like my first test. I first tried with 1 470 and saw about 16,000PMKs/s. I tried two and got about 20,000PMKs/s, then switched to debian 5 and tried x86 and x86_64, all with the same results (I was switching operating systems for a different purpose, but same benchmarks on everything).
I saw 35,000PMKs/s last night when I ran a benchmark_long.
I am trying to get at least near or above 100k-120k PMKs/s which should be a breeze according to other benchmarks, heh. Something just isn't making sense. byteflip told me to find you on here and figure out what's wrong though, so ya, any help is greatly appreciated. Seeing four old cheap 280s get 120k compared to this just doesn't click.
I've already invested ~3k in this project, if I need to switch cards, not a big deal, I just want to see some good results.
Thanks in advance.
-lh