|
马上注册,结交更多好友,享用更多功能,让你轻松玩转社区。
您需要 登录 才可以下载或查看,没有账号?注册
x
经过几天的试验和努力,同时得到鸟哥的指点,终于装好了这个东西。为了不至于马上忘记,赶紧将如何安装记录下来,让更多的人掌握这个技术文档。同时在此非常感谢鸟哥(vbird)。
我们的平台:
AMD4000+
MPI1.4.7
PGI
1、到 https://sourceforge.net/project/showfiles.php?group_id=23725 下载 atlas3.6.0.tar.gz 文档
2、到 http://www.netlib.org/benchmark/hpl/index.html 下载 hpl.tgz 文档
3、在普通用户下用命令 tar -zxvf **将两个文件解压,就会看见ATLAS和hpl两个文件夹。
4、现安装atlas,进入ATLAS,用make config CC=/usr/mpich/bin/mpicc(CC所等的是编译器地址,各台机器不尽相同,我试了gcc好像不行),然后会出一些选择,一般默认即可,注意architecture这里的多少位的选择,一般先在新出的大多为64位的。我的是64位,architecture用默认的Linux_HAMMER64SSE2_2。
5、然后就对Make..Linux_HAMMER64SSE2_2作一定的修改,主要是对编译器路径的修改,见附件。
6、修改好以后,make install arch=Linux_HAMMER64SSE2_2,一般就会成功。如果失败,建议删除整个ATLAS,再从头来,每一步都要更仔细就好,多试几下是有好处的。
附录:
# -----------------------------
# Make.ARCH for ATLAS3.6.0
# -----------------------------
# ----------------------------------
# Make sure we get the correct shell
# ----------------------------------
SHELL = /bin/sh
# -------------------------------------------------
# Name indicating the platform to configure BLAS to
# -------------------------------------------------
ARCH = Linux_HAMMER64SSE2_2
# -------------------
# Various directories
# -------------------
TOPdir = /home/tfbao/ATLAS
INCdir = $(TOPdir)/include/$(ARCH)
SYSdir = $(TOPdir)/tune/sysinfo/$(ARCH)
GMMdir = $(TOPdir)/src/blas/gemm/$(ARCH)
UMMdir = $(GMMdir)
GMVdir = $(TOPdir)/src/blas/gemv/$(ARCH)
GR1dir = $(TOPdir)/src/blas/ger/$(ARCH)
L1Bdir = $(TOPdir)/src/blas/level1/$(ARCH)
L2Bdir = $(TOPdir)/src/blas/level2/$(ARCH)
L3Bdir = $(TOPdir)/src/blas/level3/$(ARCH)
TSTdir = $(TOPdir)/src/testing/$(ARCH)
AUXdir = $(TOPdir)/src/auxil/$(ARCH)
CBLdir = $(TOPdir)/interfaces/blas/C/src/$(ARCH)
FBLdir = $(TOPdir)/interfaces/blas/F77/src/$(ARCH)
BINdir = $(TOPdir)/bin/$(ARCH)
LIBdir = $(TOPdir)/lib/$(ARCH)
PTSdir = $(TOPdir)/src/pthreads
MMTdir = $(TOPdir)/tune/blas/gemm/$(ARCH)
MVTdir = $(TOPdir)/tune/blas/gemv/$(ARCH)
R1Tdir = $(TOPdir)/tune/blas/ger/$(ARCH)
L1Tdir = $(TOPdir)/tune/blas/level1/$(ARCH)
L3Tdir = $(TOPdir)/tune/blas/level3/$(ARCH)
# ---------------------------------------------------------------------
# Name and location of scripts for running executables during tuning
# ---------------------------------------------------------------------
ATLRUN = $(BINdir)/ATLrun.sh
ATLFWAIT = $(BINdir)/xatlas_waitfile
# ---------------------
# Libraries to be built
# ---------------------
ATLASlib = $(LIBdir)/libatlas.a
CBLASlib = $(LIBdir)/libcblas.a
F77BLASlib = $(LIBdir)/libf77blas.a
PTCBLASlib = $(LIBdir)/libptcblas.a
PTF77BLASlib = $(LIBdir)/libptf77blas.a
LAPACKlib = $(LIBdir)/liblapack.a
TESTlib = $(LIBdir)/libtstatlas.a
# -------------------------------------------
# Upper bound on largest cache size, in bytes
# -------------------------------------------
L2SIZE = -DL2SIZE=1048576
# ---------------------------------------
# Command setting up correct include path
# ---------------------------------------
INCLUDES = -I$(TOPdir)/include -I$(TOPdir)/include/$(ARCH) \
-I$(TOPdir)/include/contrib
# -------------------------------------------
# Defines for setting up F77/C interoperation
# -------------------------------------------
F2CDEFS = -DAdd__ -DStringSunStyle
# --------------------------------------
# Special defines for user-supplied GEMM
# --------------------------------------
UMMDEFS =
# ------------------------------
# Architecture identifying flags
# ------------------------------
ARCHDEFS = -DATL_OS_Linux -DATL_ARCH_HAMMER64 -DATL_SSE2 -DATL_SSE1 -DATL_GAS_x8664 -m64
# -------------------------------------------------------------------
# NM is the flag required to name a compiled object/executable
# OJ is the flag required to compile to object rather than executable
# These flags are used by all compilers.
# -------------------------------------------------------------------
NM = -o
OJ = -c
# ---------------------------------------------------------------------------
# Fortran 77 compiler and the flags to use. Presently, ATLAS does not itself
# use any Fortran 77, but vendor BLAS are typically written for Fortran, so
# any links that include non-ATLAS BLAS will use FLINKER instead of CLINKER
# ---------------------------------------------------------------------------
F77 = /usr/mpich/bin/mpif77
F77FLAGS = -fomit-frame-pointer -O -m64
FLINKER = $(F77)
FLINKFLAGS = $(F77FLAGS)
FCLINKFLAGS = $(FLINKFLAGS)
# ---------------------------------------------------------------------------
# Various C compilers, and the linker to be used when we are not linking in
# non-ATLAS BLAS (which usually necessitate using the Fortran linker).
# The C compilers recognized by ATLAS are:
# CC : Compiler to use to compile regular, non-generated code
# MCC : Compiler to use to compile generated, highly-optimized code
# XCC : Compiler to be used on the compile engine of a cross-compiler
# These will typically all be the same. An example of where this is not
# the case would be DEC ALPHA 21164, where you want to use gcc for MCC,
# because DEC';s cc does not allow the programmer access to all 32 floating
# point registers. However, on normal C code, DEC';s cc produces much faster
# code than gcc, so you CC set to cc. Of course, any system where you are
# cross-compiling, you will need to set XCC differently than CC & MCC.
# ---------------------------------------------------------------------------
CDEFS = $(L2SIZE) $(INCLUDES) $(F2CDEFS) $(ARCHDEFS) -DATL_NCPU=2
GCCdir = /usr/bin
GOODGCC = /usr/mpich/bin/mpicc
CC = /usr/mpich/bin/mpicc
CCFLAG0 = -fomit-frame-pointer -O -mfpmath=387 -m64
CCFLAGS = $(CDEFS) $(CCFLAG0)
MCC = /usr/mpich/bin/mpicc
MMFLAGS = -fomit-frame-pointer -O -mfpmath=387 -m64
XCC = /usr/mpich/bin/mpicc
XCCFLAGS = $(CDEFS) -fomit-frame-pointer -O -mfpmath=387 -m64
CLINKER = $(CC)
CLINKFLAGS = $(CCFLAGS)
BC = $(CC)
BCFLAGS = $(CCFLAGS)
ARCHIVER = ar
ARFLAGS = r
RANLIB = echo
# -------------------------------------
# tar, gzip, gunzip, and parallel make
# -------------------------------------
TAR = /bin/tar
GZIP = /bin/gzip
GUNZIP = /bin/gunzip
PMAKE = $(MAKE) -j 2
# ------------------------------------
# Reference and system libraries
# ------------------------------------
BLASlib =
FBLASlib =
FLAPACKlib =
LIBS = -lpthread -lm
# ----------------------------------------------------------
# ATLAS install resources (include arch default directories)
# ----------------------------------------------------------
ARCHDEF = $(TOPdir)/CONFIG/ARCHS/HAMMER64SSE2/gcc/misc
MMDEF = $(TOPdir)/CONFIG/ARCHS/HAMMER64SSE2/gcc/gemm
INSTFLAGS =
# ---------------------------------------
# Generic targets needed by all makefiles
# ---------------------------------------
waitfile:
前面说了atlas之安装,现在紧接着hpl之安装,这里基本没什么好说的,
1、按照鸟哥的说法,“If you use the ';atlas'; library, I think you should use this file ';Make.Linux_ATHLON_CBLAS';”,我们将hpl/setup/下的文件Make.Linux_ATHLON_CBLAS 复制到hpl/ 下
2、然后对Makefile和Make.Linux_ATHLON_CBLAS几处作修改,一个是Makefile里将arch的值改为';arch = Linux_ATHLON_CBLAS';,而对Make.Linux_ATHLON_CBLAS的修改较多,详情见附件。
3、make即可完成安装
4、如果安装成功,那么在hpl/bin/Linux_HAMMER64SSE2_2下就会看见HPL.dat 和xhpl两个文件,前者是数据设置,后者是执行文件。如果不成功,反复研究你的设置,再试几次就可能会好得。不过说实话,我也很碰运气而以,就我重新装,不一定马上能装好。
至于测试的具体方法,我也不知道,有时间的话后面会给出一些,或者干脆转载一些,不过我实在不乐意转载别人的文章,说实话大多数没有经过测试,转来转去没有一点用,这里保证我们的技术文章都是经过测试的,这样实用价值更大一些。
附件1:
#
# -- High Performance Computing Linpack Benchmark (HPL)
# HPL - 1.0a - January 20, 2004
# Antoine P. Petitet
# University of Tennessee, Knoxville
# Innovative Computing Laboratories
# (C) Copyright 2000-2004 All Rights Reserved
#
# -- Copyright notice and Licensing terms:
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions
# are met:
#
# 1. Redistributions of source code must retain the above copyright
# notice, this list of conditions and the following disclaimer.
#
# 2. Redistributions in binary form must reproduce the above copyright
# notice, this list of conditions, and the following disclaimer in the
# documentation and/or other materials provided with the distribution.
#
# 3. All advertising materials mentioning features or use of this
# software must display the following acknowledgement:
# This product includes software developed at the University of
# Tennessee, Knoxville, Innovative Computing Laboratories.
#
# 4. The name of the University, the name of the Laboratory, or the
# names of its contributors may not be used to endorse or promote
# products derived from this software without specific written
# permission.
#
# -- Disclaimer:
#
# THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
# ``AS IS';'; AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
# LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
# A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE UNIVERSITY
# OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
# SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
# LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
# DATA OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
# THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
# (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
# OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
# ######################################################################
#
#
SHELL = /bin/sh
#
arch = Linux_ATHLON_CBLAS
#
## Targets #############################################################
#
all : install
#
# ######################################################################
#
install : startup refresh build
#
startup :
$(MAKE) -f Make.top startup_dir arch=$(arch)
$(MAKE) -f Make.top startup_src arch=$(arch)
$(MAKE) -f Make.top startup_tst arch=$(arch)
$(MAKE) -f Make.top refresh_src arch=$(arch)
$(MAKE) -f Make.top refresh_tst arch=$(arch)
#
refresh :
$(MAKE) -f Make.top refresh_src arch=$(arch)
$(MAKE) -f Make.top refresh_tst arch=$(arch)
#
build :
$(MAKE) -f Make.top build_src arch=$(arch)
$(MAKE) -f Make.top build_tst arch=$(arch)
#
clean :
$(MAKE) -f Make.top clean_src arch=$(arch)
$(MAKE) -f Make.top clean_tst arch=$(arch)
#
clean_arch :
$(MAKE) -f Make.top clean_arch_src arch=$(arch)
$(MAKE) -f Make.top clean_arch_tst arch=$(arch)
#
clean_arch_all :
$(MAKE) -f Make.top clean_arch_all arch=$(arch)
#
clean_guard :
$(MAKE) -f Make.top clean_guard_src arch=$(arch)
$(MAKE) -f Make.top clean_guard_tst arch=$(arch)
#
# ######################################################################
附件2:#
# -- High Performance Computing Linpack Benchmark (HPL)
# HPL - 1.0a - January 20, 2004
# Antoine P. Petitet
# University of Tennessee, Knoxville
# Innovative Computing Laboratories
# (C) Copyright 2000-2004 All Rights Reserved
#
# -- Copyright notice and Licensing terms:
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions
# are met:
#
# 1. Redistributions of source code must retain the above copyright
# notice, this list of conditions and the following disclaimer.
#
# 2. Redistributions in binary form must reproduce the above copyright
# notice, this list of conditions, and the following disclaimer in the
# documentation and/or other materials provided with the distribution.
#
# 3. All advertising materials mentioning features or use of this
# software must display the following acknowledgement:
# This product includes software developed at the University of
# Tennessee, Knoxville, Innovative Computing Laboratories.
#
# 4. The name of the University, the name of the Laboratory, or the
# names of its contributors may not be used to endorse or promote
# products derived from this software without specific written
# permission.
#
# -- Disclaimer:
#
# THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
# ``AS IS';'; AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
# LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
# A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE UNIVERSITY
# OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
# SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
# LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
# DATA OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
# THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
# (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
# OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
# ######################################################################
#
# ----------------------------------------------------------------------
# - shell --------------------------------------------------------------
# ----------------------------------------------------------------------
#
SHELL = /bin/sh
#
CD = cd
CP = cp
LN_S = ln -s
MKDIR = mkdir
RM = /bin/rm -f
TOUCH = touch
#
# ----------------------------------------------------------------------
# - Platform identifier ------------------------------------------------
# ----------------------------------------------------------------------
#
ARCH = Linux_ATHLON_CBLAS
#
# ----------------------------------------------------------------------
# - HPL Directory Structure / HPL library ------------------------------
# ----------------------------------------------------------------------
#
TOPdir = /home/tfbao/hpl
INCdir = $(TOPdir)/include
BINdir = $(TOPdir)/bin/$(ARCH)
LIBdir = $(TOPdir)/lib/$(ARCH)
#
HPLlib = $(LIBdir)/libhpl.a
#
# ----------------------------------------------------------------------
# - MPI directories - library ------------------------------------------
# ----------------------------------------------------------------------
# MPinc tells the C compiler where to find the Message Passing library
# header files, MPlib is defined to be the name of the library to be
# used. The variable MPdir is only used for defining MPinc and MPlib.
#
MPdir = /usr/mpich
MPinc = -I$(MPdir)/include
MPlib = $(MPdir)/lib/libmpich.a
#
# ----------------------------------------------------------------------
# - Linear Algebra library (BLAS or VSIPL) -----------------------------
# ----------------------------------------------------------------------
# LAinc tells the C compiler where to find the Linear Algebra library
# header files, LAlib is defined to be the name of the library to be
# used. The variable LAdir is only used for defining LAinc and LAlib.
#
LAdir = $(HOME)/ATLAS/lib/Linux_HAMMER64SSE2_2
LAinc =
LAlib = $(LAdir)/libcblas.a $(LAdir)/libatlas.a
#
# ----------------------------------------------------------------------
# - F77 / C interface --------------------------------------------------
# ----------------------------------------------------------------------
# You can skip this section if and only if you are not planning to use
# a BLAS library featuring a Fortran 77 interface. Otherwise, it is
# necessary to fill out the F2CDEFS variable with the appropriate
# options. **One and only one** option should be chosen in **each** of
# the 3 following categories:
#
# 1) name space (How C calls a Fortran 77 routine)
#
# -DAdd_ : all lower case and a suffixed underscore (Suns,
# Intel, ...), [default]
# -DNoChange : all lower case (IBM RS6000),
# -DUpCase : all upper case (Cray),
# -DAdd__ : the FORTRAN compiler in use is f2c.
#
# 2) C and Fortran 77 integer mapping
#
# -DF77_INTEGER=int : Fortran 77 INTEGER is a C int, [default]
# -DF77_INTEGER=long : Fortran 77 INTEGER is a C long,
# -DF77_INTEGER=short : Fortran 77 INTEGER is a C short.
#
# 3) Fortran 77 string handling
#
# -DStringSunStyle : The string address is passed at the string loca-
# tion on the stack, and the string length is then
# passed as an F77_INTEGER after all explicit
# stack arguments, [default]
# -DStringStructPtr : The address of a structure is passed by a
# Fortran 77 string, and the structure is of the
# form: struct {char *cp; F77_INTEGER len;},
# -DStringStructVal : A structure is passed by value for each Fortran
# 77 string, and the structure is of the form:
# struct {char *cp; F77_INTEGER len;},
# -DStringCrayStyle : Special option for Cray machines, which uses
# Cray fcd (fortran character descriptor) for
# interoperation.
#
F2CDEFS =
#
# ----------------------------------------------------------------------
# - HPL includes / libraries / specifics -------------------------------
# ----------------------------------------------------------------------
#
HPL_INCLUDES = -I$(INCdir) -I$(INCdir)/$(ARCH) $(LAinc) $(MPinc)
HPL_LIBS = $(HPLlib) $(LAlib) $(MPlib)
#
# - Compile time options -----------------------------------------------
#
# -DHPL_COPY_L force the copy of the panel L before bcast;
# -DHPL_CALL_CBLAS call the cblas interface;
# -DHPL_CALL_VSIPL call the vsip library;
# -DHPL_DETAILED_TIMING enable detailed timers;
#
# By default HPL will:
# *) not copy L before broadcast,
# *) call the Fortran 77 BLAS interface
# *) not display detailed timing information.
#
HPL_OPTS = -DHPL_CALL_CBLAS
#
# ----------------------------------------------------------------------
#
HPL_DEFS = $(F2CDEFS) $(HPL_OPTS) $(HPL_INCLUDES)
#
# ----------------------------------------------------------------------
# - Compilers / linkers - Optimization flags ---------------------------
# ----------------------------------------------------------------------
#
CC = /usr/mpich/bin/mpicc
CCNOOPT = $(HPL_DEFS)
CCFLAGS = $(HPL_DEFS) -fomit-frame-pointer -O3 -funroll-loops -W -Wall
#
LINKER = /usr/mpich/bin/mpicc
LINKFLAGS = $(CCFLAGS)
#
ARCHIVER = ar
ARFLAGS = r
RANLIB = echo
#
# ----------------------------------------------------------------------
|
|