Search

1 to 10 of 48
Sort by

Blog Entry
Do you compile you application without compiler optimization? You need to read this!

Originally posted by: JimMcInnes I got curious about what the real speedup is from using -O2, so I did a few runs on an old machine we have sitting around in our shop. I used XLC V11.1 and XLF V13.1 to measure the old SPEC CPU2000 benchmark suite. I did two measurements: one with ...

Archive User's profile image

Blog Entry
New OpenMP features

Originally posted by: StevenPerron OpenMP is an add-on to the C, C++, and FORTRAN programming languages that is meant to give the programmer an easy and portable way to parallelize their programs. This is done by adding directives (or pragmas), run-time routines, and environment...

Archive User's profile image

Blog Entry
Assist Threads Part 1

Originally posted by: stan kvasov Today’s multi-core processors support many threads of execution and can provide substantial performance when running multithreaded applications. Unfortunately, multithreaded programming is difficult, and as a result, a lot of today’s software is still single...

Archive User's profile image

Blog Entry
The View from IWOMP 2010 (Trip Report)

Originally posted by: Michael Wong This week was the 2010 International Workshop on OpenMP (IWOMP) in the Science City Tsukuba, where we meet annually to showcase the latest research of parallel computing and OpenMP. There is also an OpenMP language committee meeting immediately following...

Archive User's profile image

Blog Entry
POWER7 SPEC CPU2006 featuring automatic parallelization in XLC

Originally posted by: rauls With the recent publication of the SPEC CPU2006 scores of the POWER7-based p780 server, the IBM Power Systems have regained leadership on both the SPECint 2006 and SPECfp 2006 components of this industry benchmark suite. In particular , the peak FP score of 71.5...

Archive User's profile image

Blog Entry
BoostCon 2010 Trip Report

Originally posted by: Michael Wong Hi, all. I came back from BoostCon2010: http://www.boostcon.com/program#schedule where I delivered three talks and participated in a panel discussion on Transactional Memory, along with such luminaries as Maurice Herlihy (the father of TM), Mark Moir (Sun...

Archive User's profile image

Blog Entry
MASS v6.0 -- High performance math libraries for POWER7

Originally posted by: Robert Enenkel Mathematical Acceleration Subsystem (MASS) Version 6.0 for AIX has been released. This high performance suite of elementary and special mathematical functions contains two new libraries tuned for the IBM POWER7 processor, and exploiting the POWER7 VSX...

Archive User's profile image

Blog Entry
MASS auto-vectorization paper now on developerWorks

Originally posted by: Robert Enenkel If your applications call mathematical functions such as sin, cos, exp, log, etc. and you are interested in maximizing performance with minimum effort, here is something that will interest you! My colleague Daniel Zabawa and I have written a paper, "How...

Archive User's profile image

Blog Entry
XL compilers and Parallel Environment

Originally posted by: AbC There are varieties of tools that work with the IBM XL compilers. Some help productivity in the development phase (IBM debugger, RDp), some help exploit the architecture characteristics (compiler report) and some help utilize the hardware. The IBM Parallel...

Archive User's profile image

Blog Entry
The pre-birth of the C++0x Standard.

Originally posted by: Michael Wong As of March 26, a Final Committee Draft (FCD) has been sent to the ISO office. This contains all the updates from the last meeting in March, and addresses all the issues from the comments from CD1 from 2008. This will now be released for public comment until...

Archive User's profile image