SASAdv12月真题-63题.doc_咨信网zixin.com.cn

资源描述

1、1. When attempting to minimize memory usage, the most efficient way to do group processing when using the MEANS procedure is to use: A. the BY statement. B. GROUPBY with the NOTSORTED specification. C. the CLASS statement. D. multiple WHERE statements. (The question is pretty clear-cut, IMO. It allu

2、des to the standard long- bearded practical trick used in the situation when MEANS/SUMMARY with CLASS uses too much memory. It does not ask what is more efficient in general (which would be to use something else instead of MEANS if possible), just what leads to the minimal memory footprint, and the

3、answer is A. (Answer B is just a distraction - a common device in computer-adaptive tests). The reason is that without BY, MEANS creates and builds its AVL tree for the entire file and all categorical values crossings found in CLASS. With BY, the proc builds the tree for the current BY group only, c

4、omputes the stats, then erases the tree before the next BY group starts, creates and builds it again, and so on. The erase/create process is not a zero cost one, so the smaller the more numerous the BY groups are, the more it slows does the processing; however, the smaller is the largest BY group, t

5、he smaller is the memory footprint.)2. The SAS data set WORK.CHECK has a variable named Id_Code in it. Which SQL statement would create an index on this variable? A. create index Id_Code on WORK.CHECK; B. create index(Id_Code) on WORK.CHECK; C. make index=Id_Code from WORK.CHECK; D.define index(Id_C

6、ode) in WORK.CHECK; 3. Given the SAS data sets: WORK.EMPLOYEE WORK.NEWEMPLOYEE Name Dept Names Salary - - - - Alan Sales Michelle 50000 Michelle Sales Paresh 60000 A SAS program is submitted and the following is written to the SAS log: 101 proc sql; 102 select dept, name 103 from WORK.EMPLOYEE 104 w

7、here name=(select names from newemployee where salary 40000) ERROR: Subquery evaluated to more than one row. 105 ; 106 quit; What would allow the program to successfully execute without errors? A. Replace the where clause with: where EMPLOYEE.Name=(select Names delimited with , from WORK.NEWEMPLOYEE

8、 where Salary 40000); B. Replace line 104 with: where EMPLOYEE.Name =ANY (select Names separated with , from WORK.NEWEMPLOYEE where Salary 40000); （any表示要有多个值，而=只能要求一个值。） C. Replace the equal sign with the IN operator. D. Qualify the column names with the table names. 4. Given the SAS data set SASUS

9、ER.HIGHWAY: Steering Seatbelt Speed Status Count - - - - - absent No 0-29 serious 31 absent No 0-29 not 1419 absent No 30-49 serious 191 absent no 30-49 not 2004 absent no 50+ serious 216 The following SAS program is submitted: proc sql noprint; select distinct Speed _insert_SQL_clause_ from SASUSER

10、.HIGHWAY ; quit; title1 Speed values represented are: &GROUPS; proc print data=SASUSER.HIGHWAY; run; Which SQL clause stores the text 0-29, 30-49, 50+ in the macro variable GROUPS? A. into &GROUPS B. into :GROUPS C. into :GROUPS separated by , D. into &GROUPS separated by , (2012/11/17日考题中选项有改动，但是答案

11、没有变。)5. The SAS data set WORK.CHECK has an index on the variable Code and the following SAS program is submitted. proc sort data=WORK.CHECK; by Code; run; Which describes the result of submitting the SAS program? A. The index on Code is deleted. B. The index on Code is updated. C. The index on Code

12、is uneffected.D. The sort does not execute. （在于有了index的variable，不能再用by来SORT了。）6. The table WORK.PILOTS contains the following data: WORK.PILOTS Id Name Jobcode Salary - - - - 001 Albert PT1 50000 002 Brenda PT1 70000 003 Carl PT1 60000 004 Donna PT2 80000 005 Edward PT2 90000 006 Flora PT3 100000 Th

13、e data set was summarized to include average salary based on jobcode: Jobcode Salary Avg - - - PT1 50000 60000 PT1 70000 60000 PT1 60000 60000 PT2 80000 85000 PT2 90000 85000 PT3 100000 100000 Which SQL statement could NOT generate this result? A. select Jobcode, Salary, avg(Salary) label=Avgfrom WO

14、RK.PILOTSgroup by Jobcodeorder by Id ; B. select Jobcode, Salary, (select avg(Salary) from WORK.PILOTS as P1 where P1.Jobcode=P2.Jobcode) as Avg from WORK.PILOTS as P2 order by Id ; （这里利用的是自身的数据集，分别设置不同的别名，可以当做两个数据集来使用。不过这里WHERE语句很值得学习，我个人对这里的WHERE语句这样理解的：(select avg(Salary) from WORK.PILOTS as P1 w

15、here P1.Jobcode=P2.Jobcode)启示执行之后返回的是三个AVG均值，但是因为这三个AVG均值都是与JOBCODE匹配的，那么前面的select Jobcode, Salary，所以当执行到(select avg(Salary) from WORK.PILOTS as P1 where P1.Jobcode=P2.Jobcode)这里时，实际上是选择了select jobcode 中jobcode对应的avg均值。）C. select Jobcode, Salary, (select avg(Salary) from WORK.PILOTS group by Jobcode

16、) as Avg from WORK.PILOTSorder by Id ; (子查询涉及不止一项。)D. select Jobcode, Salary, Avg from WORK.PILOTS, (select Jobcode as Jc, avg(Salary) as Avg from WORK.PILOTS group by 1)where Jobcode=Jcorder by Id ; 7. A quick rule of thumb for the space required to run PROC SORT is: A. two times the size of the SA

17、S data set being sorted. B. three times the size of the SAS data set being sorted. C. four times the size of the SAS data set being sorted. D. five times the size of the SAS data set being sorted. (140-2010: Dear Miss SASAnswers: A Guide to Sorting Your Data 中的描述是这样的：If you want the sort to complete

18、 entirely in memory, a simple rule of thumb is four times the size of the data set.In releases prior to SAS 9, the required workspace is approximately three to fourtimes the size of the data file. Beginning with SAS 9, the required workspace isapproximately twice the size of the data file. The works

19、pace can be allocated in memory and/or on disk as a utility file, depending on which sort utility and options are specified.Dear MissSASAnswers, Can you please give me some guidance on how much memory it takes to sort a 2GB table Signed, Memory Deprived Dear Memory Deprived, If you want the sort to

20、complete entirely in memory, a simple rule of thumb is four times the size of the data set. And Im assuming the data set is not compressed or being subset with a DROP=/KEEP= data set option or a WHERE statement. A better estimate would be to use this formula to predict the amount of memory: (length

21、of observation+sum of lengths of BY variables)*number of observations)* 1.10The amount of space that the SAS sort needs depends on the following conditions: whether the sort can be done with threading the length of the observations the number of variables in the BY statement and their storage length

22、s the operating environment in which PROC SORT executes whether the LINGUISTIC= option is being used (that takes more memory) Use the SORTSIZE= option in the PROC SORT statement to do the following: specify the amount of memory that is available to the SORT procedure improve the sort performance by

23、restricting the swapping of memory to disk that is controlled by the operating system)8. Multi-threaded processing for PROC SORT will effect which of these system resources?A. CPU time will decrease, wall clock time will decrease B. CPU time will increase, wall clock time will decrease C. CPU time w

24、ill decrease, wall clock time will increase D. CPU time will increase, wall clock time will increase 9. Given the SAS data set WORK.TRANSACT: Rep Cost Ship - - - SMITH 200 50 SMITH 400 20 JONES 100 10 SMITH 600 100 JONES 100 5 The following output is desired: Rep - - JONES 105 SMITH 250 Which SQL st

25、atement was used? A. select rep, min(Cost+Ship)from WORK.TRANSACT order by Rep ; B. select Rep, min(Cost,Ship) as Minfrom WORK.TRANSACT summary by Reporder by Rep ; C. select Rep, min(Cost,Ship)from WORK.TRANSACT group by Rep order by Rep ; （改成min(sum(cost,ship)才对） D. select Rep, min(Cost+Ship)from

26、WORK.TRANSACT group by Rep order by Rep ; 10. The following SAS program is submitted: %let Value=9; %let Add=5; %let Newval=%eval(&Value/&Add); %put &Newval; What is the value of the macro variable Newval when the %PUT statement executes? A. 0.555B. 2 C. 1.8 D. 1 11. The following SAS code is submit

27、ted: data WORK.TEMP WORK.ERRORS / view=WORK.TEMP; infile RAWDATA; input Xa Xb Xc; if Xa=. then output WORK.ERRORS; else output WORK.TEMP; run; Which of the following is true of the WORK.ERRORS data set?A. The data set is created when the DATA step is submitted.B. The data set is created when the vie

28、w TEMP is used in another SAS step.C. The data set is not created because the DATA statement contains a syntax error. D. The descriptor portion of WORK.ERRORS is created when the DATA step is submitted. （A DATA step view cannot contain global statements, host-specific data set options, or most host-

29、specific FILE and INFILE statements. Also, a DATA step view cannot be indexed or compressed.假设在d盘建SAS文件夹下有view.txt文件：. 10 1110 11 12. 10 1221 22 23在SAS编程如下：filename rawdata d:sasview.txt;data WORK.TEMP WORK.ERRORS / view=WORK.TEMP; infile rawdata;input Xa Xb Xc;if Xa=. then output WORK.ERRORS; else

30、output WORK.TEMP; run; proc print data=temp;run;这道题很有味道。如果rawdata是在SAS安装系统的默认地址下，那么加上分号就可以了。）12. Which title statement would always display the current date? A. title Today is: &sysdate.; B. title Today is: &sysdate9.; C. title Today is: &today.; D. title Today is: %sysfunc(today(),worddate.); 13. G

31、iven the SAS data sets: WORK.ONE WORK.TWO Id Name Id Salary - - - - 112 Smith 243 150000 243 Wei 355 45000 457 Jones 523 75000 The following SAS program is submitted: data WORK.COMBINE; merge WORK.ONE WORK.TWO; by Id; run; ( like horizontal full join ) Which SQL procedure statement produces the same

32、 results? A. create table WORK.COMBINE as select Id, Name, Salary from WORK.ONE full join WORK.TWO on ONE.Id=TWO.Id; B.create table WORK.COMBINE as select coalesce(ONE.Id, TWO.Id) as Id, Name, Salary from WORK.ONE, WORK.TWO where ONE.Id=TWO.Id ; C. create table WORK.COMBINE as select coalesce(ONE.Id

33、, TWO.Id) as Id, Name, Salary from WORK.ONE full join WORK.TWO on ONE.Id=TWO.Idorder by Id ; D. create table WORK.COMBINE as select coalesce(ONE.Id, TWO.Id) as Id, Name, Salary from WORK.ONE, WORK.TWO where ONE.Id=TWO.Id order by ONE.Id ; 14. The following SAS program is submitted: proc contents dat

34、a=TESTDATA.ONE; run; Which SQL procedure step produces similar information about the column attributes of TESTDATA.ONE? A. proc sql; contents from TESTDATA.ONE; quit; B. proc sql; describe from TESTDATA.ONE; quit; C. proc sql; contents table TESTDATA.ONE; quit; D. proc sql; describe table TESTDATA.O

35、NE; quit; 15. Given the SAS data set WORK.ONE: Rep Cost - - SMITH 200 SMITH 400 JONES 100 SMITH 600 JONES 100 The following SAS program is submitted; proc sql; select Rep, avg(Cost) from WORK.ONE order by Rep ; quit; Which result set would be generated? A.JONES 280 JONES 280 SMITH 280 SMITH 280 SMIT

36、H 280 B.JONES 600 SMITH 100 C.JONES 280 SMITH 280 D.JONES 100 JONES 100 SMITH 600 SMITH 600 SMITH 600 16. Given the SAS data sets: WORK.MATH1A WORK.MATH1B Name Fi Name Fi - - - - Lauren L Smith M Patel A Lauren L Chang Z Patel A Hillier R The following SAS program is submitted: proc sql; select * fr

37、om WORK.MATH1A _insert_set_operator_ select * from WORK.MATH1B ; quit; The following output is desired: Name Fi - - Lauren L Patel A Chang Z Hillier R Smith M Lauren L Patel A Which SQL set operator completes the program and generates the desired output? A. append corr B. union corr（select unique ro

38、ws）C. outer union corr (using corr to overlay by the same names)D. intersect corr 17. Which of the following is an advantage of SAS views? A. SAS views can access the most current data in files that are frequently updated.B. SAS views can avoid storing a SAS copy of a large data file. C. SAS views can decrease programming time. D. both A and B are true 18. In what order does SAS search for format definitions by default?A. 1. WORK.FORMATS 2. LIBRARY.FORMATS B. 1. LIBRARY.FORMATS 2. WO

展开阅读全文