Subzero: Add fabs intrinsic support.

The intrinsic is lowered using the standard technique of masking off the FP sign bit, which is the high-order bit.

To construct this mask, we use the existing trick of loading a vector register with all "1" bits, then logical-shift-right by one bit.

In the future, we should add 128-bit vector values to the constant pool and force them to memory, and this could be used for the other routines that synthesize a vector constant.

BUG= https://code.google.com/p/nativeclient/issues/detail?id=4097
R=jvoung@chromium.org

Review URL: https://codereview.chromium.org/1022573004
diff --git a/crosstest/crosstest.cfg b/crosstest/crosstest.cfg
index edb06ea..2222b90 100644
--- a/crosstest/crosstest.cfg
+++ b/crosstest/crosstest.cfg
@@ -8,7 +8,7 @@
 
 [test_arith]
 driver: test_arith_main.cpp
-test: test_arith.cpp test_arith_frem.ll test_arith_sqrt.ll
+test: test_arith.cpp test_arith_frem.ll test_arith_sqrt.ll test_arith_fabs.ll
 
 [test_bitmanip]
 driver: test_bitmanip_main.cpp