I have been trying to use "-fprofile-generate" and "-fprofile-use" for some small bitwise C benchmarks (developed at MIT). I have a check-out of October 2004 GCC build of 4.0 version.
Try a checkout from today and let us know if the problem remains unfixed. If it is, please file a PR on out web site, thanks.