Exam 2, Biology 301D, 17 October 2001 Printed name ________________

(2pts) Social security number and name. Bubble in the scantron with your name and your social security number (your SSN goes(in the first 9 bubbles of the scantron ID field). (For good measure, put your name on this hard copy too, in case your key code does not carry over.)

(2pts) Key Code: One of the exam questions will indicate a key code. Make sure you fill those in as indicated on the question indicated.

When finished, turn in both the Scantron and hard copy. You may write on this hard copy, but your grade will be determined by the Scantron form.

None, one, all, or any combination of individual answers may apply to a question unless stated otherwise.

Radiation

1-3. Which of the following statements are currently accepted (from lecture and the book). (None, one or many)

1. (4 points). Which are true? (easy)

(A) High doses are used as models of low doses when calculating the cancer risk from radiation exposure

(B) The four abstract models of the relationship between cancer rate and radiation exposure (i.e., the graphical models: linear, threshold, accelerating, decelerating) all exhibit the general property that the highest cancer rates occur at the highest exposures.

(C) The use of a single measure of radiation exposure for different types of radiation (rem or rad) is a false model, at least because the different types of radiation have different biological effects.

(D) The physical models underlying our understanding of the cancer risk caused by radiation include (i) different types of cancer, (ii) different types of radiation, and (iii) different doses.

2. (4 points). Which are true? (medium difficulty)

(A) The type of cancer most commonly studied in conjunction with radiation has been solid-tumor cancers (e.g., lung, colon) because of its relatively short time-lag between radiation exposure and appearance of the cancer (5 years).

(B) Mice have been used as a model of humans to test which of the abstract models (linear, threshold, ...) best fits the relationship between cancer and radiation because this relationship appears to be insensitive to differences between mice and humans.

(C) Living in Denver increases your background level of radiation from at least two sources (cosmic, rocks) compared to living in Austin.

(D) We do not try to estimate elevated cancer rates from people given medical X-rays on a yearly basis because the effects of this excess radiation are too small to measure.

3. (5 points). Which are true? (hardest)

(A) Smoking increases radiation levels to the lungs by 80X-100X.

(B) Recent work suggests that low levels of radiation may actually reduce cancer rate (radiation hormesis).

(D) The radioactive “mom” advertisement warned about the dangers of exposure to high levels of radiation in some waste sites.

(E) If a million people were exposed to a single dose of radiation equivalent to 10 times the US average background level, the yearly number of excess cancer deaths from that exposure would be at least several hundred, starting a few years after the exposure.

Science and the Law

In the following questions, “fingerprints” refers to the prints made by your fingers, not to DNA typing.

4. (4 pts) Which of the following are true? (mostly facts)

(A) Since the death penalty was reinstated in 1976, slightly more than 1% of people sentenced to die have been released because they were later shown to be innocent.

(B) The use of fingerprint evidence in US courts spanned nearly 90 years in the 1900s. Into the 1990s, the main professional organization of fingerprint experts in the US resisted the use of objective criteria for deciding whether two fingerprints matched.

(C) The use of hair matching (by methods that do not involve DNA) has been bolstered recently by the creation of a database that encodes hair characteristics from over 150,000 different hair samples

(D) The use of dog olfaction (sniffing) to identify items of clothing belonging to specific individuals has no meaningful basis (is nonsense).

5. (5 pts) Which of the following are true? (facts connected to course themes)

(A) A proficiency test is a test involving the use of standards. It establishes a human and technical error rate.

(B) When tests of fingerprint experts were finally conducted in the 1990s, false positive rates by fingerprint experts were found to be 1% or less.

(C) The routine use of DNA tests before trial in the last few years of criminal investigation has shown that in approximately 10% of cases, the prosecution’s “prime” suspect was the wrong person.

(D) Of the 4 types of errors (RPA, sampling, human & technical, bias), the types most common in leading to wrongful convictions are bias and human & technical.

6. (4 pts) Which of the following are true? (more facts connected to course themes)

(A) In 62 cases of convictions that were later overturned by DNA evidence (you may accept that 62 is correct), the most common factor leading to wrongful conviction was mistaken identity.

(B) Unintentional bias is only a minor problem in witness recognition of a suspect, because even though witnesses can make mistakes, those witnesses who identified the wrong person have done so in their first attempts at identification and were not influenced by subtle factors introduced after the investigation began.

(D) The use of blind samples in lab tests is warranted chiefly to avoid fraud and is not a contributor to unintentional bias, because honest technicians are not influenced by knowing sample identity.

Data: Error

(7-10). (3 pts each) For each of the following descriptions, indicate the types of error present (the underlined phrase identifies the error). Mark a type of error only if it is definitely present. Do not assume any more than what is explicitly mentioned in the problem. One answer only for each question, though an option may be used to answer more than one question.

	(A) Rounding, Precision and accuracy
Types of Error:	(B) Sampling
	(C) Human and technical
	(D) Bias
	(E) No error is indicated

7. (3 pts) Time magazine once reported on an amusing issue with a toilet paper company. Company ads boasted of at least 500 sheets per roll, but someone had determined that the rolls were a few sheets short. This discovery led to meetings of company officials, with lots of hand-wringing. Then someone else went and counted the number of sheets and found that 500 was correct. The person who had carried out the first “count” had measured the length of a few sheets to determine the length per sheet. He then measured the length of the entire roll and divided by his estimated length per sheet to estimate the number of sheets in a roll. His number came up short because his length per sheet was not exact. What kind of error is represented by the fact that his measure of the length of a single sheet was not good enough to calculate the exact number of sheets from the length of the roll? (one only)

(A) (B) (C) (D) (E)

8. (3 pts) John and Paul race on foot for 100 meters to determine who is fastest. They hire an atheletic service to photograph the race and time each of them. They run together, side-by-side, and John wins the race, arriving at the finish line 2 meters ahead and 0.3 seconds ahead of Paul. What kind of data error is indicated by John’s time being shorter (John’s running faster) than Paul’s? (one answer only)

(A) (B) (C) (D) (E)

9. (3 pts)? I want to know what fraction of students enrolled in class actually show up for lecture, so that the class enrollment limit can be set as high as possible but without overfilling the classroom during lecture. (We will assume that exams are given in a different room). A count of students in class one day reveals 103 attending. I contemplate using that number but then deicide that the number may not be subject to error. So I count the number of students attending during the next two lectures and observe 82 and 150. Although my counts on each day may differ slightly from the actual numbers, these differences between days are real. What kind of error is revealed by the variation between days in the number of students in class as a model of how many students attend lecture? (one answer only)

(A) (B) (C) (D) (E)

10. (3 pts) The supermarket at which you shop sometimes rings up a different price at the checkout stand than the price listed in the isle for that item. You decide to do a brief study of this pattern. You detect 6 instances of price discrepancies, and 5 of them involve a higher price at checkout. Feeling incensed about this, you do even more research and find 15 more cases of price differences, and 13 of those are higher at the checkout than in the isle. What type of error is indicated by the fact that, when there is a price discrepancy, the price at checkout is usually the higher one ? (one answer only)

(A) (B) (C) (D) (E)

Ideal Data

(11-17). For each of the following statements, mark the appropriate letters that describe the data design features present. Base your answer only on the information given. That is, mark a data feature only if it is explicitly present in the problem description. One or many bubbles (unless indicated).

(A) explicit protocol	(C) standards	(E) blind
(B) replicates	(D) random	(F) none

11. (2 pts). Which features are important in reducing bias? (A) (B) (C) (D) (E) (F)

12. (2 pts). Which features are important in reducing/detecting human and technical error? (A) (B) (C) (D) (E) (F)

13. (2 pts). Which features are important in reducing sampling error? (A) (B) (C) (D) (E) (F)

14. (3 pts) A pesticide is developed by a chemical company. To test its effectiveness at killing crop pests, the company conducts a trial following the procedure of a trial conducted in the previous year: the pesticide is applied to four plots, half of which are heavily infested with insect pests. After 2 days, a farmer evaluates only the plots that were heavily infested at the outset.

(A) (B) (C) (D) (E) (F)

15. (3 pts) A farmer wants to know if using fertilizer will increase the amount of wheat harvested. To test this hypothesis, he divides his field into 64 plots. Each of these plots is a square 200m x 200m. To determine whether or not to use fertilizer in a plot, a coin is flipped. At the end of summer, the yield of wheat is measured in each plot. To determine that his machinery is harvesting the actual amount of wheat per plot, he harvests half of one plot by hand and compares that yield with the yield from the machine's harvest of the other half

(A) (B) (C) (D) (E) (F)

16. (3 pts) Scientists often make a distinction between "objective" data which typically requires little interpretation by the person gathering the data (such as a student's score on a multiple choice exam), and "subjective" data which may involve substantial interpretation (such as a student's score on an essay exam). Which ideal data feature is often required in order for subjective data to be reliable, but is often not necessary for objective data to be reliable? Exactly one answer is correct.

(A) (B) (C) (D) (E) (F)

17. (3 pts) A middle school student decides to determine if Fridays on the 13th day of the month are unlucky. She solicits records from emergency-room hospital admissions and police arrest reports for all four Friday-the-13ths during a two year period. To establish a base-line level of expected admissions and arrests, she also obtains records from hospitals and police for the Fridays immediately before and after each Friday-the-13th. To guard against errors in her analysis of the data, she has the analysis repeated by a second person, who also does not know which group of admissions is for Friday-the-13th and which is not.

(A) (B) (C) (D) (E) (F)

18-20. Do-it-yourself protocol. You are conducting an external review/test of a DNA lab. Your job is to send two tubes to the lab, with labels. There are several options for the content of and label on a tube. You must decide which contents to send and how to label the tubes so that the features of ideal data requested in the question are present from the lab's perspective. If a tube has a person's name on it, the lab can assume that the tube contents belong to the name of the person on the label. If a tube is labeled with a number, the contents are unknown to the lab but known to you. Your options for tube contents and tube labels are:

option	tube label	Contents in the tube are from	Blood type	Gender
(A)	Sam Edwards	Sam Edwards	AB	Male
(B)	Holly Berry	Holly Berry	O	Female
(C)	Marty Ray	Marty Ray	O	Male
(D)	#13	Wayne Crill	A	Male
(E)	#21	Wayne Crill	A	Male
(F)	#100	June Scott	B	Female
(G)	#243	Linda Davis	O	Female
(H)	#17	Joel Sachs	AB	Male
(I) No combination of tubes can satisfy the protocol

In the following questions, choose two letters among options (A)-(H) to describe the two tubes that will be sent to the lab. If it is possible to satisfy the protocol, the question will require exactly two letters and only two letters -- one for each tube. Thus, the answer for a question might be (A) & (B), or it might be (D) & (F). If more than one pair of options are possible correct answers, fill in only one correct pair of options. Thus, if (A) & (B) is one acceptable answer, and (C) & (D) is another acceptable answer, fill in either (A)&(B) or (C)&(D), but not both. If a factor (such as identity, blood type, gender) is not specified in the protocol, then that factor will be ignored in grading the answer. Alternatively, if a protocol cannot be satisfied with options for tubes (A)-(H), fill in (I).

18. (2 pts) Choose two tubes to guarantee replication of blood type but not for individual.

two tubes: (A) (B) (C) (D) (E) (F) (G) (H) (I)

19. (2 pts) The lab has a way to identify the gender (sex) of a sample. Make the protocol fully blind to the lab and replicated for gender but not replicated for an individual or blood type.

two tubes: (A) (B) (C) (D) (E) (F) (G) (H) (I)

20. (2 pts) Make the protocol fully blind to the lab and not replicated in any way (neither gender, blood type, nor individual).

two tubes: (A) (B) (C) (D) (E) (F) (G) (H) (I)

Drug Testing

(21-23). Each of items (A) through (J) describes a feature of Department of Transportation drug tests. The details of the drug testing rules are presented in the Federal Register 49 CFR PART 40 and described in a 100+ page pamphlet "Guidelines for Implementing the FHWA anti-drug program."

(A) Each manager whose employees are tested is required to include 3 known blank samples (lacking drugs) for every 100 unknowns; managers with more than 2000 drivers must also include some samples known to contain drugs.

(B) The laboratory initially tests all samples with a rapid screen; positive samples are retested more comprehensively.

(C) When a sample is sent to the laboratory for testing, it is labeled with a code, rather than the name of the person being tested.

(D) Split sample collection is required -- the partitioning of the original sample into two vials -- so that the second sample can be retained for retesting.

(E) The rules specify that any retesting (in the event of a positive sample) is to be done by a different laboratory than did the original test.

(F) The regulations require that any driver involved in an accident be tested for drugs and alcohol.

(G) Lab results are sent to the medical review officer before being returned to the office originating the test.

21. (3 pts) Which feature(s) represent standards in some part of the drug-testing protocol? (none, one, or many)

(A) (B) (C) (D) (E) (F) (G)

22. (3 pts) Which feature(s), if followed, ensure that DOT drug tests of an individual's sample are done blind? (none, one, or many)

(A) (B) (C) (D) (E) (F) (G)

23. (3 pts) Which feature(s) provide replication to guard against human and technical error? (none, one, or many)

(A) (B) (C) (D) (E) (F) (G)

DNA Typing

(24-26) Which of the following violations of ideal data were discussed in class or the book? An option is correct only if the example was discussed AND it is a valid example of the type of violation listed. For example, an option that listed the failure to replicate a test as a violation of randomization would not be considered correct. In most cases, you will need to rely on information from the book and lecture to provide details needed to answer the question.

24. (5 pts). Which are true? (general questions about DNA typing)

(A) When done properly, DNA results have a low probability of producing a chance (random) match between the forensic sample and a person not associated with the crime. This probability is often less than 1/million.

(B) DNA can be extracted from most tissues and from only trace amounts. For example, enough DNA can be recovered from the back of a postage stamp to determine whether a particular suspect licked the stamp.

(D) Human and technical error: the best guess of current errors due to such problems as sample mixup is a few percent (e.g., 2%).

25. (5 pts). Which are true? (questions about individual cases in DNA typing)

(A) Violation of explicit protocol: the lab in the Castro case violated its own measurement criteria for declaring a match

(B) Lack of blind: the Castro case revealed one or more instances in which knowledge of the sample identity likely led to technicians ignoring inconsistencies in the data.

(C) Lack of blind: letters from the Chicago Police Dept. requesting DNA testing from the FBI lab provided suggestive information about the suspect’s guilt.

(D) Lack of randomization: In the Castro case, the failure to randomize the order in which standards and Castro’s sample were scored caused the bar code to be misread in one test.

Evaluation and Interpretation

26. (8 pts) The following points pertain to the introductory lectures on evaluation. Which statements are true? (none, one, or many).

(A) Data are considered inconsistent with models because all models are false.

(B) If data are consistent with a model, it means that they support it.

(D) Classifying data as irrelevant to a model means that the data could not possibly refute it no matter how they turned out.

(E) Classifying data as irrelevant to a model means that the data were not gathered according to the Ideal Data template

(F) Data gathered on the incidence of business failures in Austin would likely be irrelevant to models about the origins of the AIDS virus.

(G) In considering guilt versus innocence of a suspect in a crime, the law specifies that we adopt the view that the suspect is considered innocent until proven guilty (consider this statement true). The model of innocence is an example of a null model because it is the model we accept until evidence forces us to reject it.

(H) In science and in many aspects of society, our willingness to reject a null model increases as evidence accumulates against that model. For example, in class, our acceptance of the safety of a vaccine (against the null model that the vaccine was not safe) increased as the sample of successful trials increased from 10 individuals to millions of individuals.

(I) In a Pepsi versus Coca-Cola taste test, one would use either null model that Pepsi tastes best or that Coca-Cola tastes best.

27.(5 pts) Which of the following describe non-zero correlations?

(A) on a per-car basis, red cars are involved in more accidents than cars of other colors

(B) people who drink approximately 1 oz of alcohol per day have higher survival rates than non-drinkers

(D) mice carry diseases that can infect humans

(E) The people you encounter on Mondays tend to be the same as those you encounter on Wednesdays and Fridays, but different from those you encounter on Tuesdays and Thursdays

(F) The average score on exam 2 is the same as the average on exam 1

28. (5 pts) Which points about correlations are true?

(A) Correlations are consistent with many abstract models

(B) a zero correlation means that there is only one variable

(D) advertisements commonly use correlations in attempting to get people to buy a product

(E) if we know there is a correlation between variables X and Y (height and weight), it means that we can predict the average value of Y once we know the value of X

(F) if we know there is a correlation between height and weight of people, it means that we know a specific person’s weight, we also know that person’s height.

(G) It is not possible to determine whether a correlation exists when one of the variables is “country of residence,” because a value cannot be assigned to this kind of variable.

29. (2 pts). Key code. Fill in (A) and (B) on the scantron field for question 29 to indicate your key code.

Radiation

Science and the Law

Ideal Data

Blood type

Drug Testing

Evaluation and Interpretation