UNITED STATES DEPARTMENT OF AGRICULTURE

 

 

 

 

 

 

 

 

 

FOOD AND DRUG ADMINISTRATION

 

FOOD ADVISORY COMMITTEE

 

 

 

MEETING ON INFANT FORMULAS

 

 

 

 

 

 

 

 

 

 

 

 

 

Monday, November 18, 2002

 

 

8:20 a.m.

 

 

 

 

 

 

 

 

U.S. Department of Agriculture

Animal and Plant Health Inspection Service Building

4700 River Road

Riverdale, Maryland


Temporary Voting Members Present

 

James Anderson, Ph.D.

Robert D. Baker, M.D., Ph.D.

Margaret E. Briley, Ph.D., R.D., L.D.

Scott Denne, M.D.

Cutberto Garza, M.D., Ph.D., Chairman

James E. Heubi, M.D.

Laurie J. Moyer-Mileur, Ph.D., R.D., C.D.

Virginia A. Stallings, M.D.

Patti Thureen, M.D.

 

Participating Food Advisory Committee

Members Present

 

Annette Dickinson, Ph.D.

Goulda Angella Downer, Ph.D.

Lawrence N. Kuzminski, Ph.D.

Madeleine J. Sigman-Grant, Ph.D.

 

Acting Industry Representative

 

Roger A. Clemens, Dr.P.H. CNS FACN


FOOD ADVISORY COMMITTEE ON INFANT FORMULAS

 

AGENDA

 

Welcome, Introduction and Charges

  Christine J. Taylor, FDA    4

 

Administrative Issues -- Cathy DeRoever,

  Mary Ann Killian and Jeanne Latham, FDA    7

 

Regulatory Background

  Christine J. Taylor, FDA    23

 

Remarks by Chairperson, Cutberto Garza    32

 

Presentations by Invited Speakers

  Physical Growth Measurements

    W. Cameron Chumlea    43

 

  Body Composition, Kenneth J. Ellis    72

 

  World Health Organization Growth

    Reference, Edward A. Frongillo    103

 

  National Center for Health Statistics/

    Centers for Disease Control and

    Prevention Growth Charts, Lawrence M.

    Grummer-Strawn    127

 

  Iowa and Iowa/Fels Growth Data

    Samuel J. Fomon    152

 

  Growth Data for Preterm Infants

    Jon Tyson    169

 

  Analytical Issues, Edward A. Frongillo    202

 

Questions to Previous Speakers

 

  Clinical Studies -- Product Composition

  Considerations, Duane Benton    228

 

  Clinical Studies -- Clinical

    Consideration, Dennis M. Bier    259

 

Questions to Speakers    283

Preliminary Discussion of Issues by Committee    313

Concluding Remarks    383


P R O C E E D I N G S

    DR. TAYLOR:  I'm Christine Taylor, and I'm director of Office of Nutritional Products Labeling and Dietary Supplements at FDA's Center for Food Safety.

    We will be this morning going through a series of conversations, and what you've got right now is our overview.  So given the fact that we started just a few minutes late, we'll go ahead and try to shortcut this overview.

    In the next few minutes, we'll do a brief overview for this Food Advisory Committee meeting.  We'll review a couple of the administrative issues, which will be focused on ethics and conflicts of interest.  We'll spend some time on regulatory context for this meeting, and then we will begin the meeting per se with Dr. Bert Garza serving as chair.

    During the meeting, there will be a series of presentations and white papers, which we have provided as background information for the committee.  There will be some discussion, and then beginning tomorrow there will be public comments, more discussion, and response.

    We're going to spend just a very few minutes this morning going over the role and expertise of the task force members, and I think the key point to be recognized is that there are several different kinds of members sitting with us today.

    FDA considers a number of factors in selecting individuals to serve on the Food Advisory Committee, including their scientific expertise, as well as issues related to conflict of interest.

    We also have sitting with us a consumer representative.  This person is a voting member of the Committee and represents the consumer perspective on issues and actions that come before the Committee.

    We also have an industry representative sitting with the committee.  This is a nonvoting member, and they're responsible for representing all members of the industry, and not any particular association, company or product.

    Basically, the kinds of members we have on this committee are temporary voting members, as well as some members of our larger Food Advisory Committee.  As I mentioned, we also have consumer reps and industry reps.

    What I'd like to do, at this point, starting with Dr. Baker, if you will, Dr. Baker, just so we can get used to using the microphones, if you would let us know your name, where you are, and if you're too modest, I have a listing of your expertise.

    DR. BAKER:  Robert Baker from Buffalo, New York.  I'm a pediatric gastroenterologist, and I have a Ph.D. in biochemistry and in nutrition.

    DR. STALLINGS:  I'm Virginia Stallings, from Children's Hospital in Philadelphia.  I'm the head of the Nutrition Section there, and I do work in healthy children and children with chronic disease related to nutrition.

    DR. HEUBI:  I'm Jim Heubi.  I'm a pediatric gastrologist, as well.  I'm the program director for the GCRC, the General Clinical Research Center at the Children's Hospital in Cincinnati, and I have a longstanding interest in nutrition relating to infant nutrition bone disease, cholesterol, metabolism, you name it, there's a variety of things.

    DR. ANDERSON:  I'm Jim Anderson.  I'm at the University of Nebraska Medical Center in Omaha, Nebraska.  I'm chairman of the Department of Preventive and Societal Medicine, and I'm a biostatistician by training.

    DR. DOWNER:  I'm Goulda Downer, a doctorate in Human Nutrition, with a residency in pediatrics at Georgetown.  Currently, I'm a clinical nutritionist with my own practice, and I'm also on faculty at George Washington University.

    MS. SIGMAN-GRANT:  I'm Madeleine Sigman-Grant.  I'm a maternal and child nutrition specialist at the University of Nevada Cooperative Extension.

    DR. MOYER-MILEUR:  I'm Laurie Moyer-Mileur, from the University of Utah.  I'm a registered dietician with a doctorate in exercise physiology, and I have over 20 years of neonatal nutrition experience.

    DR. GARZA:  I'm Bert Garza.  I'm a professor of nutrition at Cornell University.  I'm both an M.D. and have a Ph.D. in nutritional biochemistry and metabolism, and my primary interests have been in maternal-child health, with interests in growth, and protein and energy metabolism.

    DR. KUZMINSKI:  I'm Larry Kuzminski.  I'm from Duxbury, Massachusetts.  I'm retired from the food processing industry, having R&D responsibilities and operations responsibilities with the Kellogg Company and with Ocean Spray Cranberries.

    DR. DENNE:  I'm Scott Denne.  I'm from Indiana University.  I'm a pediatric neonatologist.  I have a longstanding interest in neonatal nutrition, specifically, and protein and energy metabolism.

    DR. THUREEN:  I'm Patti Thureen, a neonatologist from the University of Colorado in Denver, and my particular interest is in protein and energy metabolism in the extremely low-birth-weight neonate.

    DR. BRILEY:  I'm Margaret Briley from the University of Texas at Austin, and my expertise has been in nutrition of children and child care.

    DR. TAYLOR:  If we could just stop right there.  Margaret is our consumer rep, and on our right we have Dr. Roger Clemens, who is substituting for Annette Dickinson, who is our industry rep.

    I'll go to the next slide and just give a minute or two about the staff you have sitting at the table with you.  As I've mentioned, I'm with the Office of Nutritional Products Labeling and Dietary Supplements.

    We also have Dr. Susan Walker, who is our associate director for Clinical Affairs, as well as Dr. Beth Yetley, who's the lead scientist for nutrition.

    Jeanne Latham, who is sitting next to Dr. Bert Garza, is our executive secretary, and we're being joined today by Ms. Mary Ann Killian, who is program integrity adviser at the Office of Human Resources at FDA.

    Let me just spend a very quick minute, and then we will return with a regulatory context.  I think in terms of mechanics, we need to understand kind of where we are in the process.  Currently, we are operating as an ad hoc task force to the Food Advisory Committee.  In the very near future, we will constitute an Infant Formula Subcommittee of the Food Advisory, but currently we are still in the ad hoc mode.

    The current focus of the Infant Formula Advisory Meetings is to obtain scientific input for evaluating whether new infant formula supports normal physical growth of infants.  This comes under Section 412 of the Food, Drug, and Cosmetic Act, which in a few moments we'll come back to in more detail.  We're looking basically for scientific input, which eventually will inform the Agency relative to regulatory efforts.

    What we're undergoing currently is a series of meetings, and I'm sure most of you remember that last April we held our first meeting on this issue of normal physical growth.  It included a somewhat general discussion, an effort to understand the regulatory context, as well as a few specific questions about extrapolation and attrition in the study.

    This is the second of this series, and the general scientific topics for today fall into three categories: Growth Measures and Methodologies, the Role of Such Measures and Methodologies in  Demonstrating Normal Physical Growth, and then, finally, Principles and Criteria to Determine the Need for a Clinical Study to Provide the Agency an Assurance of Normal Physical Growth.

    It's always helpful to be clear about what's not on the table.  There are so many issues in the area of infant formula, normal physical growth, other issues related to the Agency's regulatory purview that sometimes it's important to realize there are things that are of great interest, but are not on the table for discussion.

    This lists a few, probably the ones that our discussions will most likely tend to gear toward.  The design and conduct of studies is not on the agenda today, other endpoints of clinical studies is not on the agenda.  What constitutes major and minor changes is not on the agenda.  That's, of course, for those of you that are intimately involved in the regulatory, you understand that that has regulatory meaning.  The nutritional impact or efficacy of formulas, the safety of individual ingredients and specific regulatory decisions are not topics for today.

    We have provided specific background for the committee in the form of white papers.  We have a total of nine white paper which, as Dr. Garza will explain in a few moments, we'll go through this morning.  Each of the papers will be introduced by an expert, and then of course discussed by the committee as appropriate.  Those related to the assessment of normal physical growth are listed here, and then for our second topic, changes warranting a clinical study, we have two white papers.  Those should be in your notebooks and available for further discussion.

    Now, the Agency's role is to give you specific charges that are to be accomplished by the end of the meeting on Tuesday, and those charges are in your notebook in the form of seven questions, and I won't go through them now.  I think Dr. Garza will take the time to do that with you later on, but they fall into basically four categories: Metrics for evaluation of growth, which is Questions 1, 2, 3A and 3B; questions about comparators, Questions 4 and 5; controlled feeding parameters, Question 6; and then changes in composition, Question 7.

    Just for the group of us here, the summary of the charges fall into two categories: The criteria for adequate evaluation of normal physical growth during the first six months, and here are several substantive ones.  Again, they are specifically it in your questions; and then, secondly, the type of changes in infant formula that should warrant a clinical study.  Again, those are the remaining questions in your notebook.

    Just in terms of the mechanics, this morning, next, we'll cover the administrative issues, Jeanne Latham, as assisted by Mary Ann Killian, will go through that with you.

    I'll return, and with the help of Dr. Walker and Dr. Yetley, give you some regulatory context and then the actual task force meeting will begin.

    What we'll do is hold questions until after the administrative component, and then again after the regulatory context, and then we should be on our way.

    So, Jeanne, I'll turn the meeting over to you, and dutifully return for the next part.   Thank you.

    MS. LATHAM:  Good morning.  I'm Jeanne Latham and, first of all, in terms of administrative issues, we wanted to have Cathy DeRoever's statement read into the record, and Dr. Garza will take care of that.

    Thank you.

    DR. GARZA:  Catherine DeRoever, the executive secretary of the Food Advisory Committee, was asked to take a few minutes to refresh everyone's memory about a few of the rules of the road, in terms of Advisory Committee operations, so I'm going to be reading her statement.

    It is my understanding that all committee members have been provided with a copy of a Committee Member Guide to FDA Advisory Committees and a video.  The video's title is "A Panel Member's Responsibility."  I believe there are copies of the Member Guide available at the registration desk for anyone who may be interested.  The Committee Member Guide is in need of updating but, by and large, it provides a good operational overview.

    FDA relies on its Advisory Committees to provide the best-possible scientific advice available to assist us in making complex decisions.  Our goal is to do this in as open and transparent a manner as possible.  Part of that openness carries with it a request that the members try to avoid even the appearance that issues are being decided or conclusions are being reached outside the actual meeting.

    We understand that issues raised during the meeting may well lead to conversations over breaks or during the meal.  In fact, we hope the discussions are thought-provoking.  We have had instances where the members have come back from a break and said, "You know, we were talking over break, and we would like to request that FDA provide us some additional information so we can better understand thus and such."  This is perfectly acceptable.

    What we don't want is to have a situation where after the break the members come back and say, "We were talking over break, and we decided that the answer to Question 1 is..."  From our perspective, that would be particularly troublesome because neither the Agency, nor the public, would have had the benefit of listening to the entire discussion, the questions raised, the responses, et cetera.

    In fact, FDA has recently adopted a policy that only matters that can be decided by a show of hands are procedure matters, for example, break times.  I'm not sure I understand that.

    [Laughter.]

    DR. GARZA:  All other votes and comments must be placed on the record, attributed to the member making the statement.  The policy goes even further.  If a member has to leave the meeting early, that member waives the right to vote.  You may wonder why would the person lose their right to vote, but the answer is fairly simple.  FDA believes all parts of the meeting and the discussions are important.  Consequently, voting on issues without having the benefit of all of the discussion would be premature.

    The issue of openness is larger than what transpires during the course of the meeting.  I would like to call your attention to the section in the Members' Guide, titled, "Member Interaction Before, During and After a Meeting."  In essence, this section underscores the fact that all communication with the members should be routed through the Committee's executive secretary.  No one, not even FDA staff, with the exception of the executive secretary, should be contacting the members about upcoming meetings, topics, et cetera.

    This same guidance applies to consultations between members prior to a meeting.  If a member receives an inappropriate contact, the members should feel free to notify the executive secretary and/or refer the person making the contact to the executive secretary.  Our goal in having all contacts routed through the executive secretary is to minimize any situation that could be misinterpreted.

    Appearance issues are always difficult because, as is true of many things, appearances can be deceiving.  We ask that our members, guest speakers, and everyone attending the meeting be mindful of how an interaction between a member and anyone, for that matter, might be perceived.

    Please let me be clear it is not my intention to question anyone's motives or integrity, but I am very sensitive to the issue because I have, and imagine so have you, seen highly respected individuals become the object of negative attention based on a misperception, and I certainly wouldn't want anyone in this room to become such a target.

    I am confident that everyone here is sensitive to these issues and can appreciate that my comments are intended as a gentle reminder.

    Thank you.

    Any questions?  Which I will refer to Ms. Latham.

    [Laughter.]

    DR. GARZA:  From any of the committee members?  Is all of that clear?

    [No response.]

    DR. GARZA:  Thank you.

    MS. LATHAM:  Good morning.  I am Jeanne Latham, the executive secretary for the FDA's Food Advisory Committee on Infant Formula.  I want to welcome everyone, and I'd like to read the conflict of interest statement for the record.

    The following announcement addresses the issue of conflict of interest with respect to this meeting and is made a part of the record to preclude even the appearance of such at this meeting.

    By the authority granted under the Food Advisory Committee Charter of July 2002, the following individuals have been appointed as temporary voting members by Joseph A. Levitt, director, Center for Food Safety and Applied Nutrition:

    James Anderson, Ph.D; Margaret Briley, Ph.D.; Robert Baker, M.D., Ph.D.; Scott Denne, M.D.; Cutberto Garza, M.D., Ph.D.; James Heubi, M.D.; Laurie Moyer-Mileur, Ph.D.; Virginia Stallings, M.D.; Patti Thureen M.D.

    The issues to be discussed at this meeting are issues of broad applicability.  Unlike issues in which a particular sponsor's product is discussed, the matters at issue do not have a unique impact on any particular product or manufacturer, but rather may have widespread implications with respect to all infant formulas and their manufacturers.

    To determine if any conflicts of interest exist, the committee participants have been screened for interest in companies that make infant formula.  As a result of this review, in accordance with 18 United States Code, Section 208(b)(3), Dr. Cutberto Garza has been granted a particular matter of general applicability waiver that permits him to participate fully in the matters at issue.  A copy of the waiver statement may be obtained by submitting a written request to the Agency's Freedom of Information Office, Room 12A30 of the Parklawn Building.

    With respect to FDA's invited guest speakers, there are reported interests that we believe should be made public to allow the participants to objectively evaluate their comments.

    Dr. W. Cameron Chumlea has a grant from Nestle to serve as a coordinating center for a nutritional study of Chinese elderly.

    Dr. Samuel Fomon previously consulted with firms that make infant formula and is likely to do so in the future.

    Dr. Duane Benton owns stock in Abbott Laboratories, and he receives retirement benefits from Abbott.

    Dr. Dennis Bier's employer, the ARS Children's Nutritional Research Center, recently received the Bristol-Myers Squibb-Mead Johnson nutritional 2002 unrestricted nutritional research grant.  As Center director, Dr. Bier is named as the principal investigator, although no funds come to him personally or for his personal research.

    We would also like to note for the record that Dr. Roger Clemens is participating in this meeting as the acting industry representative and a nonvoting member of the Committee.

    In the event that the discussions involve any other issues not already on the agenda, for which FDA participants have a financial interest, the participant's involvement and their exclusion will be noted for the record.

    With respect to all other participants, we ask, in the interest of fairness, that they address any current or previous financial involvement with any firm that makes infant formula.

    Thank you.

    With that, I will turn the program back over to Dr. Taylor.

    DR. TAYLOR:  Thank you very much, Jeanne.

    Our goal for the next 15 or 20 minutes is to set the regulatory context for the discussions we're having today.  For those of you that remember the spring meeting, we did spend some time on that, and hopefully most of this is a review, and all we have to add is an additional focus relative to the topic for today.  As I've mentioned earlier, this is an ad hoc task force of the Food Advisory Committee, addressing infant formula issues.

    Obviously, we have statutory authority relative to infant formula, and its long history goes back to 1980, at which time Congress passed special legislation that amended the Food, Drug, and Cosmetic Act.  We, in the Agency, try to avoid throwing numbers, and clauses and phrases around, but it's almost impossible not to, and the key phrase is that it provided Section 412 to the Food, Drug and Cosmetic Act.

    In 1986, Congress had an interest in adding to this, providing more statutory authority, and so there were some additional amendments in 1986.

    I think what we have to keep in mind is that the infant formula legislation happened for a very specific reason.  Infant formula is unique from other foods.  It is the sole source of nutrition for a vulnerable population.  In Congress's mind, it therefore warranted a special set of provisions for regulation.

    It's clear that the intent and outcome of this action was the following statement from Congress.  It should not only be safe, which I would point out is handled in separate sets of provisions than what we're addressing today and contain all of the necessary nutrients, which again is the separate set of provisions which are not on the table for today, but also should provide those nutrients in a bio-available form to ensure that the infant formula were to support optimal infant growth and health.  That's what we're about today in some respects.

    This chart is a little complicated at first, but I think it sets the context for what we call to be regulatory boxes, and really what's on the table today is this particular component, but all of this is the regulation of infant formula.  The safety of the individual ingredients, the classic safety considerations are handled under a separate set of provisions, the so-called Section 409.  So ingredients for intended use, that's where most of your classic safety reviews come in.

    Section 412, as provided for by Congress in 1980, is really a statutory check on a particular formulated product.  In providing those assurances, companies consider the required nutrients that have to be in the formula, the good manufacturing practices or GNPs and quality controls, and then quality factors.

    Today, we are focusing on quality factors.  These assurances are provided prior to marketing, and once marketing occurs, in the world of infant formula, the claims then come in, as far as efficacy, truthful, and not misleading, again, a separate set of provisions.  I'll come back to this in a moment, but the key component is that we're here taking a look at that.

    Now, as mentioned just a second ago, in order to provide these assurances, manufacturers submit a notification to FDA 90 days prior to their intention to market that particular infant formula.  Again, as I mentioned, it's specific to a finished product.  The Agency reviews it, again, as I just mentioned, for those three components, and it's here, hopefully, highlighted in red that our questions today will focus.

    The definition of quality factors is not precise.  They certainly do offer the opportunity of expanding, as needed.  There is language from a 1980 discussion in the House Committee, and their references to quality factors focus on things such as pertain to the bioavailability of a nutrient and the maintenance of levels or potency.  They discuss at great length the growth of infants during the first few months of life, and they discuss the concept of healthy growth, the idea being that once you've formulated a product, it needs to support healthy growth.  So, in its simplest form, quality factors are a check on the concern that once you get the entire product put together it works appropriately.

    Now the types of quality factors could be many.  At this point, we basically have two.  In the realm of nutrient-specific, we have provisions for protein efficiency ratios, protein per se, but over time, others could be put in place.  In the world of the formulation itself, the totality of the formulation the quality factor we address is normal physical growth, and, again, others could be put in place over time as needed.

    So, for today, normal physical growth as  quality factor is what's on the table.

    The scientific questions that will come through as you read the charges are basically twofold.  How do you measure and affirm normal physical growth and how and when should assurances of normal physical growth be appropriately provided?

    Going back to that again, this particular slide, quality factors, normal physical growth, assurances for a specific product, along with other components of these assurances.

    Now, just so that we're sure how it works from a regulatory perspective, we've put in this slide, but I think it's redundant to what we've said before.  In order to provide assurances, vis-a-vis Section 412, the manufacturer submits a notification 90 days prior to marketing.  FDA reviews the notification package, taking into all of the components, nutrients, GNPs, quality control and quality factors, and if assurances are adequately provided, FDA does not object to the marketing of the formula.

    If, in the Agency's opinion, assurances are not adequately provided, FDA does let the company know that it objects to the marketing of this particular formulation.  It's important to note, from a regulatory perspective, that this is not a premarket approval process, so manufacturers do have the right to go to market over FDA's objections.

    The scientific input we get from you folks today and tomorrow will certainly guide our thinking about the evaluation of normal physical growth when infants are fed a new formula.  We'd like to point out that it certainly helps us, but it's also helpful to stakeholders in that what is expected becomes clear to them.  It's not as much of a black box if it's quite clear how FDA's scientific considerations are handled.

    It's also going to guide our thinking about when clinical studies should accompany formulation of processing changes in infant formulas, and again it's helpful to us, but it's also helpful to our stakeholders.

    The outcome of today's discussions can be used to inform our ongoing reviews, but we do need to talk a little bit about current rulemaking, in that discussions today have the opportunity or the possibility of impacting on current rulemaking activities.  The current state of our rulemaking, as probably many of you know, is that in 1996, we proposed a rule to implement parts of Section 412, and in that was included the implementation of quality factors.

    That final rule has not been issued, so we are still in the process of what's known as rulemaking.  If input from this Committee is relevant, and it may or may not be, but if input from this Committee is relevant, there would need to be an opportunity to comment on that, and we would, of course, reopen the comment period on this rule for that purpose.  So we retain the option of reopening the comment period.

    So, again, just to review, you've seen this before, today's discussions, vis-a-vis the charges, our growth measures and methodologies, the role of such measures relative to normal physical growth, and the general principles and criteria to determine the need for a clinical study to provide assurances of normal physical growth.

    I think we've gone over topics not under discussion, so I'll mention these only in passing, and then again remind you that the specific charges from the Agency to the Committee are the seven questions in your notebook, and I'm sure Dr. Garza will go over those with you in some detail.

    I do want to introduce Dr. Susan Walker, who's at the table here, our associate director for Clinical Affairs, and Dr. Beth Yetley, who's our lead scientist, and the agreement we have is if you have questions, I will go join them, and we will answer them as a troika.

    Thank you very much.

    MS. LATHAM:  Are there questions?

    DR. GARZA:  Are there any questions to Dr. Taylor?

    [No response.]

    DR. GARZA:  Thank you very much.  That is clear, judging from the lack of questions from the Committee.  I do want to take this opportunity to welcome the Committee members, and guests, and staff that have joined us, and to thank Dr. Taylor because we've made up the lost time.  I was concerned that we would be running late, but we're doing all right in terms of time.

    We have a very full agenda and would like to begin by asking the Committee members if, in fact, they have any questions about the agenda.

    [No response.]

    DR. GARZA:  Very quickly, just to review the procedure, we're going to be launching into one of the major segments of the Committee meeting in just a few minutes, and that is nine presentations, based on the nine background papers which were sent to each of us several weeks ago.  Only Committee members have the privilege of asking questions to any of these presenters.  We will try to hold presentations to about 15 minutes.  Someone will be helpful in alerting the speakers when I think there is about three minutes left in their presentations to help them allocate their time appropriately, and then we will have about 10 minutes of questions from Committee members to each of their presenters.

    You also will have the opportunity to ask questions of those that make comments in the public comment period tomorrow, and we will have blocks of time then to come to some consensus on the seven questions that you have been sent as well.

    It's going to be very important that we address each of those questions carefully, and therefore I'd like to make sure that each of you has a chance to review those questions because I will be proposing time limits to assure ourselves sufficient time to deal with each of them in a way that doesn't shortchange any of them, and so we'll be trying to deal with that time allocation later this afternoon.

    All experts, I am told, will be able to stay throughout today and tomorrow for those questions and answers to that if, in fact, in those blocks of time when we're dealing with any of those seven questions, any of the committee members would like to address any questions to any of the presenters, and then obviously that's going to be possible as well.

    You have seven questions.  They have been divided for us in four sections.  One of those sections is on metrics for the evaluation of normal physical growth, a second section deals with comparators for the evaluation of normal physical growth, and a third is on controlled feeding comparators, and the fourth is on changes in infant formula composition.

    Rather than reading each of the seven questions, I'd like to take just a few minutes to ask committee members if you have any questions about the issues that we've been asked to consider under each of these sections so that, in fact, we can be clear what we're being asked to do, and you can have those clearly in mind during the presentations.

    So let's begin with the first in terms of metrics for the evaluation of normal physical growth.  There are two questions under that section.  Do any of the committee members have any questions about points that you're being asked to address?  I'll give you a few minutes to review those, and we can ask the troika to clarify those for us.  I think that's the way you were described.  That was not my word.

    Having had the pleasure to work with them, you'll get informative responses to your questions, I'm sure.

    MS. LATHAM:  At the end of last week--this is Jeanne Latham, the exec sec--we e-mailed to everyone the updated questions, and I just wanted to make sure that you all have those, and if you don't, we will get them to you.  You've got them. . Anybody that doesn't have them?

    [No response.]

    DR. GARZA:  I think they were in the packets today, again, in case you didn't bring them with you.

    Would any member of the staff want to address any questions in this section?

    [No response.]

    DR. GARZA:  I take it, then, that they're clear--Dr. Thureen?

    DR. THUREEN:  Yes, I have one question.  These are metrics for the evaluation of growth between birth and six months of age.  We will be dealing with both term and preterm infants, I presume.  Should they be handled separately?  Because the preterm infants we are discussing I believe growth after post-conceptional age birth to six months, so should they be handled separately or should we just do a general assessment of these evaluations, presumably for term infants, with maybe later adjustments for preterm infants?

    DR. GARZA:  No, I would assume that we will take those separately, but let me ask the staff if they would object, if there's any reason why we shouldn't take them separately?

    Committee members?  So we'll probably do A and B.  Thank you.  That's a good clarification, with preterm being all preterms, low birth weight, very low birth weight, and extreme low birth weight.

    Any other questions, then, on this first section?

    [No response.]

    DR. GARZA:  Then, on the second, on comparators for the evaluation of normal physical growth, let me give you a few minutes to review those two questions and see if there are any issues that need to be clarified.

    [Pause.]

    DR. GARZA:  Any questions on either of those?

    We have one question on the control  feeding comparators.  Let's take a look at that and see if that's clear.

    DR. ANDERSON:  This is Jim Anderson.

    I wonder if I could get a clarification of the difference between the current infant formula plus new ingredient that's listed on the first bullet and the infant formula plus new ingredient with the asterisks on the last of the bullets.

    DR. GARZA:  On the last bullet, right?  So it's listed below are examples of controlled feeding clinical comparators, and I believe the question is can you clarify the distinction between the first and the last bullets.

    DR. TAYLOR:  We're working on it.

    DR. GARZA:  I gather what it meant was that it was a study in which the new ingredient would be used with some infant formula, but that the intention was to market the new ingredient as a component of some other infant formula.

    DR. GARZA:  That was mine or a generic infant formula, where that new ingredient might be added to any formula, so that it would be a generic comparison was the way I read that.  Am I not clear of that?  If my interpretation is correct, so it's a generic--

    DR. WALKER:  Your interpretation is correct.

    DR. GARZA:  So it's a generic endorsement of the ingredient.

    DR. WALKER:  Right.

    DR. GARZA:  Does that clarify it?

    Dr. Walker, would you--

    DR. WALKER:  The instance in which the generic ingredient is added to the new infant formula and the way that you discussed there, Dr. Garza, is a correct interpretation.

    DR. GARZA:  So it's more an endorsement of the ingredient itself, with a test formulation, but as opposed to a specific formula that had been marketed in the past, where a new ingredient would be added.  I understood it to mean a more generalized evaluation, rather than a specific one.

    DR. WALKER:  I think after we have some of the discussions, some specifics of these will be made much more clear.  I think the speakers will address a lot of these issues in detail, and then we can have more questions.

    DR. GARZA:  And I've been reminded that we must each identify ourselves before we ask questions.  I will try to remember, but I'm probably the guiltiest of all.  I would hope that they would recognize my voice before this meeting is over.

    The statement I was supposed to read said, "I am Cathy DeRoever."  I thought I better not say that.

    [Laughter.]

    DR. GARZA:  It would confuse people, and obviously embarrass Dr. DeRoever as well.

    Are there any other questions on this third section?

    [No response.]

    DR. GARZA:  If not, then, the fourth one is the last.  It is also one question with two parts, A and B.  The table that is attached, obviously, is quite informative, so I would ask you to take a few minutes to look at Question 7, along with Table 1, see if there are any questions.

    [Pause.]

    DR. GARZA:  Are there any questions related to this section?

    [No response.]

    DR. GARZA:  If not, what I propose is the following; that you think about 30 minutes for each of the first six questions, approximately 120 minutes for the seventh question.  If we don't need the entire two hours for the last question, we can always come back and address issues that perhaps we might have felt were not completely resolved.

    Obviously, if it's clear that we need more time with any specific question, as the discussions evolve, then we can always go back and try to reallocate them, but we will come to some agreement on how much time to spend with each before we start the discussion this afternoon.  But as you've had a chance to review them, see if, in fact, you feel comfortable beginning with that type of allocation for the various questions that we're being asked to review.

    I don't think that all first six would necessarily take 30.  Some may take a bit longer, and some will take less, but I want to make sure that we don't shortchange, as I said, any specific question, and so we can get agreement on that as a group, going through the discussion and developing some information that would be useful to the FDA I think is more likely.

    Are there any other questions regarding procedures or the charge to the group that either staff or I have failed to clarify for you?

    [Pause.]

    DR. GARZA:  If not, then why don't we begin with the presentations.  I know that Dr. Chumlea was on the bus, so I assume he's here.  We're starting a bit early, but I think that's fine.  I will just introduce each speaker as they come forward.

    Dr. Cameron Chumlea is a Fels Professor at Wright State University School of Medicine in Ohio.  And for those of you in the field of anthropometry, I don't think Cameron needs any introduction.  For those that may not be familiar with his work, we don't have time.

    [Laughter.]

    DR. GARZA:  It has been quite extensive, and he's certainly recognized throughout the world for his work in this area.

    Thank you very much for the white paper and for joining us this morning, Cameron.

    DR. CHUMLEA:  Thank you very much for the introduction.  It's a pleasure to be here this morning, and I hope I can provide the committee with some information that's appropriate.

    First, I'd like to just simply recognize my co-author, Dr. Shumei Sun, who I know is familiar to many of you and point out that she has just recently become Wright State's Brage Golding Distinguished Professor of Research.

    So, first of all, I'd just like to point out that growth is relative, as you can see from the slide here, it says, "I keep track of my son's growth, which is going up the vertical scale, and my husband's growth.  Frank is age 30, 33, 35, 40."  So growth goes in various directions for all of us here.

    The second thing here is that we're dealing with infants.  Of course, there's our perspective of what infants are, but there's also a public perspective, and I saw this checking out the groceries and decided it really deserved a slide because clearly this, to some degree, is maybe the public's viewpoint, which you can clearly see.  It's amazing what you can do with PowerPoint these days and some slides that are available.

    So what I'm going to do this morning is just basically cover some brief information that's probably familiar to everybody, so that we can just kind of all come up to speed.  Clearly, infancy is a period of rapid growth, and to some extent, this is probably the most difficult group of individuals to measure.  The only other group that's equally difficult is at the opposite end of the age range, but this is generally a very difficult group, but it's also easy in the sense that there's also very few actual measurements that can be collected from them that are really going to be useful.  Weight, recumbent weight and head circumference are the three that are the most important and the ones that should be taken.

    Just to review, weight clearly measures the growth of all body tissues, recumbent length describes the amount of linear growth because we're dealing with both increase in mass and increase in size, and then head circumference reflects brain growth because this is the period of time, the first few years of life, when the brain actually does the majority of its growth.

    This is a period of time when body dimensions increase at a greater rate than in any other period in life.  Weight increases between birth and six months about 115 percent, length increases about 34 percent, head circumference increases 22 percent on average.  The rate of growth in weight ranges from about 1.1/1.2 kilos for boys or girls at one month of age, but then we have to remember that we're on a growth curve here, and then it, of course, starts to slow down, and by six months it's running around a half a kilo a month for boys and girls at six months of age.

    The rate of growth in length is about 3.5 to 4 centimeters per month for boys and girls, and it slows to about 1.5 to 2 centimeters a month for girls and boys at six months.  Just as a reference, the adolescent growth spurt between, say, you know, 12 to 16 years of age, the maximum amount there is only somewhere in the neighborhood of 5 to 8 centimeters a year.  So here we're looking at 3 to 4 centimeters a month.  So they can just put that in comparison because everybody kind of focuses and remembers how much growth their kids did when they were adolescents.  They frequently forget that that rate was a fraction of what they were really doing in the first few years of life.

    The assessment of status, once we've collected measurements, we really need the measurements to be accurate and reliable, and this is really a very critical point, particularly in this particular age range.

    The measurements are really not difficult to take, and there are a variety of mediums in which they're now described.  NCHS produced a video at the end of NHANES III that describes these and all of the measurements that were used in NHANES III on that videotape.  WHO has an in-house video that describes the measurements that are being used in their multi-center growth reference study.  I don't think that one has really actually been distributed yet, but it is available if you can talk to the right people.

    These are also all very similar techniques.  They are also probably being what's currently being done in the current NHANES, and they're all the ones that have come out of the Anthropometric Standardization Reference Manual from 1987, by Lohman.

    When we collect the measurements, this is a point where frequently things get skipped over, and the reason things get skipped over is because there's lack of time, money, personnel, et cetera, but it's the part that's really very important, in terms of collecting the information because, one, we're going to either plot those on a growth chart or refer to status, and the other aspect is what's very important here is we're going to calculate increments of rate of growth, and there you're compounding your measurement errors.

    It really takes two people to measure an infant appropriately.  I'm sorry, folks, but that's really the correct way to do it.  It can be done with one, but that's going to add to the errors that are going to be involved.  You need an examiner who's going to position and take the measurements, and you need a recorder who's going to be writing down the measurements because one person is holding the infant and trying to do the measurements, and they really don't have time to write anything down, and then they need to switch roles because we need to take double measurements here so that we can get as much information as possible.

    Also, what we'd like to do is have the technicians compare their values.  One, this just simply catches transposition errors that occur frequently because people write down numbers in different ways sometimes.  All of the studies that I've described to you have allowable differences between what the measurement values can be between the technicians, again, to control for errors and just to control for variabilities that can occur.

    Did I skip one?  How do I go back on this?

    [Pause.]

    DR. CHUMLEA:  Weight.  An infant can be weighed alone or they can be weighed while the mother is holding them.  It depends a little bit on the situation, but, frankly, I would prefer that the mother hold the infant.  You can weigh the mother, weigh the mother holding the infant, take the subtraction and you're going to get the weight.  The reason I like the idea is it keeps the baby calm, and it provides for a very stable piece of information.

    The infant can be weighed alone, and there are a variety of electronic scales, but pretty much once you take the baby out of the mother's arms and you place it on something else, it starts moving around.  Fortunately, the manufacturers of several of the scales now can compensate for this weight so the stability of the measurement is much better than it used to be.

    It's best that they be weighed nude.  Blankets, et cetera, are available.  However, if they are going to be in undergarments, I think NCHS subtracts about a tenth of a kilo from that for the readings, and spring-type scales and beam balance scales are simply not appropriate for use any more.  The electronic scales are much more available.

    There's a company called Seca that makes some very good scales.  We've used those.  WHO has a really nice platform scale that actually pares the mother's weight.  I don't know exactly what the manufacturer is, but it's a really excellent device.  I think it's been specially made for them, but there are a variety of scales that are available for use, but my preference is that the infant should be held with the mother or the caregiver, whoever is there, and then subtract the weight, if that's possible.

    This is the one nobody really likes.  It's recumbent length.  It takes two people to do it, again.  A variety of different pieces of equipment that are useful for doing this.

    It requires one person holding the infant's head.  It requires the mother, the caregiver standing there beside the infant reassuring them that nothing is going to happen, and it takes a third person then to position to footboard up against the soles of the foot of the infant.  You're holding the head so the child is looking straight up in a vertical Frankfort plane, and it takes then another person to hold the legs, both legs, for the infant, if it's very small, and get the length, and by the time they're six months, generally, the best you can sometimes do is grab one leg and try to hold the other one with your little finger and get it.  It's not easy to do, and it's one that can be particularly prone to error, and it's also important that the kid stays straight down the table.  I think I've covered what's there.

    This simply gives you a description.  I think this child is about two, the age there, but again positioning the head up against the headboard, keeping the legs straight and keeping the feet straight up in terms of taking the measurement, but clearly it takes two people to do.

    Head circumference, it should be measured with an inelastic tape of fiberglass, metal, something like that that's good.  It's really best that the infant is seated in the mother's lap.  I don't like it being done with the infant.  I've seen that done.  I like the kid up in the mother's lap, which requires then the person getting, who's taking the measurement, to get down beside the mother.  This allows the mother to cuddle the infant, keep him quiet, and you can slip the tape over their head and get the measurement before they pretty much know what happened to them.

    It's placed right across the front of the skull, and it can be quickly moved up and down the back of the skull.  The insertion tape is a nice piece of equipment that's useful for doing this until you find the greatest circumference, and then you pull the tape tight, and this is something we have to coach people in because it doesn't hurt the infant.  There's no pain involved with it.  They feel a little pressure, and they want to kind of shake it off, but it does need to be tight.

    And you can see here it's just anchored really right over the kid's eyebrows, worked up and down.  Most kids at this age aren't going to have quite this much hair, and so it's generally pretty easy to get this one from them, particularly, again, with the child being comforted by the caregiver.

    Now, what are some other measurements that could be taken?  Well, there's really a bunch of them, but I don't think those are really going to be really appropriate in the instance here.  Crown-rump length is sitting height.  Crown-rump length, I think that was pretty only used in children with special cases.  Chest circumference, limb lengths, one that's potentially possible is skinfold thicknesses.

    The problem with all of these measurements is they really kind of have a restricted utility in terms of describing normal or healthy growth.  They're frequently prone to high measurement errors, and there's really a limited amount of reference data available for all of these measurements pretty much within the age range that we're looking at, six months.

    If you're going to do something like a skinfold.  The skinfolds you're going to take are going to probably be triceps and subscapular.  Now, again, you're bringing two people into the program, if not three, and you've got to go to landmarks, and to do a triceps, you've got to find the midpoint of the arm, which means you've got to measure it, and make that determination, and you can see clearly it's requiring one person is holding the child and the other person is taking the measurement, and then you're going to have to go and take the measurement.

    The question comes up there in terms of equipment.  Skinfolds are dependent upon the type of caliper you're using.  There's two major brands, really.  There's a Lang and a Holtain, which you see here.  NCHS and WHO are both using the Holtain caliper.  I think the Lang is still used out there, to some degree, but I'm not that familiar with it any more.  There are differences there.

    Really, if you're wanting to do skinfolds, the question is what do you want to get out of that?  And probably what you wanted to get is total body fat, and there's probably, I think Dr. Ellis is probably going to talk about better ways of doing that today than taking skinfold measurements.  I don't really feel comfortable in doing it on anybody until they're about two years of age.  It's just difficult to do.

    Now, there's indices that can be used from the information that's collected.  BMI is the one that we all get informed, and just for your information, I'm 28, so you can kind of put that in reference.  I always think everybody should, when we have meetings like this, they should always walk around with their BMI on it, so we'll all be honest about this folks.

    [Laughter.]

    DR. CHUMLEA:  The problem with BMI is that, in infants, you've got 25-percent body length is composed of the head, so that throws off the proportionality aspect.  The relation of BMI with direct measures of body composition in infants hasn't really been established.  Weight for length is probably a better descriptive indices of relative leanness adiposity within children.

    Measurement error is very important and needs to be paid attention to.  The catch here is that the error which may be small is actually going to be very large because of the small size of the child that you're measuring.  So you really need to pay a tremendous amount of attention to error in measuring infants.  Of course, they can have a tremendous impact on the interpretation, if you're going to go growth increments.

    We need to get good-quality equipment.  Measurements should be taken.  If they're taken on a daily basis, the equipment needs to be calibrated.  That includes scales.  People forget that scales can go out of calibration, and then particularly the technicians need to be trained in a standardized way of taking the measurements.

    We need to collect inter- and intra-observer reliability.  Quality control is really important, particularly if there's going to be more than one center used to collect information because we need to control for inter-site differences.

    Measurement schemes.  You need baseline, an interim and a final.  I really like something that's going to be getting a measurement at 1, 2, 4 and about 6 months of age is my preference for collecting things, generally, starting after about 10 to 14 days.  With measurements at 1, 2, 4 and 6 months, you're going to get a good accountability of weight measurements over that period of time.  Clearly, if you can collect more measurements, the more measurements the better.  I'd be very happy to have those.

    I'm going to cover just very briefly growth increments, which are going to be calculated from the repeated measures of growth, and there are charts that these can be plotted on from birth to 12 and 3 to 6 months of age, which are examples here.

    These are from Fels data, and just contrary to popular opinion, the majority of Fels infants were breast fed for at least three months, exclusively, so that has been reported.  We tend to not get a good press on that.  I just want to kind of correct that.

    In terms of growth velocity data, there's the Fels data.  Also, I'd like to point out that WHO is collecting longitudinal data from its multi-center growth reference study, but this data and report from that study has not been available, and that'll probably be given later.

    So recommendations, from what I've just described to you, weights should be measured I think at 1, 2, 4 and 6 months.  I'd like to see recombinant length and head circumference at the beginning and end because it just gives you additional information on the quality of the size of the infant.  Close attention needs to be given to methodology and errors.  Two technicians are really important and reliability data needs to be collected, and use of existing increment charts until the WHO charts are available.

    So thank you very much.  I'd just like to say, personally, the last time I had to give a paper in front of Dr. Briley, she gave me a B--

    [Laughter.]

    DR. CHUMLEA:  --which is about 25 years ago.  So I hope I did at least that good this time.

    Thank you.

    DR. GARZA:  We'll take that up at the break, I guess.

    DR. CHUMLEA:  Okay.

    [Laughter.]

    DR. GARZA:  Thank you very much, Dr. Chumlea.

    Are there any questions?

    DR. SIGMAN-GRANT:  Sigman-Grant.  I have a question.  You recommend weight starting at one month, and you talked about the regain from the loss from birth weight.  So much is happening in that first month.  Why don't you measure it before--

    DR. CHUMLEA:  I said that, really, as early as, say, 10 to 14 days.  I would like to see it done that way, but within no later than one month of age.  So let me kind of restructure that between--

    DR. SIGMAN-GRANT:  Why not between--why not the first week, instead of 14 days?

    DR. CHUMLEA:  Well, there's a shift in weight, as far as I'm familiar with, that's supposed to occur after birth, and so I think, just my understanding is, that there's a period of time within the first week or State or local that the infant basically kind of stabilizes after the birth experience.  Now, Dr. Fomon, I'm sure, could give you more information on that, if I'm incorrect on that.

    The more measurements you can get out of this, the better.  I was being, trying to give you what I think is the very minimum that you have to collect there.  You can measure them every week.  That would be fine with me.

    DR. DOWNER:  Goulda Downer.  I understand, when you talked about examiner variability and possibly downright error, but can you talk a little bit more about why you don't think that subscapular skinfolds and triceps skinfold are important--

    DR. CHUMLEA:  Useful information?

    DR. DOWNER:  Yes, at this juncture, because I think it is.

    DR. CHUMLEA:  The reason I don't like them is this.  First of all, they're extremely difficult to collect in children at this age, so the amount of error that's in the measurement is extremely high.  The question I would have is that what information are you going to get out of this particular measurement that you're not going to be getting by bodyweight alone?  Because if weight is going up, the skinfolds are going to go up; if the weight is going down, the skinfolds are going to go down.

    So the question that you're really interested in is total body fat, and, yes, 90-some-odd percent of total body fat in a child is principally subcutaneous; that if you want to go total body fat, there are now better ways of doing that, such as DXA, that I think are going to give you the information that you really want.

    If you do go and collect the skinfolds, then you're faced with some reference values that are useful.  There's only two that are out there that are fairly good--what's available from NCHS and NHANES III, and then Dr. Fomon's data on skinfolds.  But outside of that, there's really little other reference data that's available.

    So I guess if I'm going to, what I want to know is total body fat, and if I want to measure total fat, I'll go measure total body fat with something that's going to give me I think better, and more accurate, and reliable information about the child than I'd get from the skinfold.

    DR. HEUBI:  Jim Heubi.  I don't want to misunderstand what you're saying, but you're not recommending that people weigh infants in garments and subtracting--

    DR. CHUMLEA:  I'm sorry, what?

    DR. HEUBI:  You're not recommending that people weigh infants in garments and then subtracting one-tenth of a kilogram for the--

    DR. CHUMLEA:  I would prefer they be weighed nude, yes.

    DR. STALLINGS:  Stallings.  To follow up a bit on the other question.  We're beginning to think I think about looking at infants who are growing too slowly as the historical way of the failure to thrive related to this, but the issues of growing too fast are also of concern.

    So, to go back to the question, could we get most of that information with weights and heights, and weights for heights, rather than looking for data related to adiposity, or if we were looking for excess growth, whatever that concept means, what would you recommend, derived values from the anthropometry or DXA?

    DR. CHUMLEA:  I think--we're still sticking between this birth and 6-month range, and I think if you're getting excess growth, you're going to, you may need to take more frequent measurements so that you can plot and get a better description of the curve as what's going on there.  And if you have more information, then you can discriminate between the children who have excess growth and those who don't.  So that's I think something that's important to consider.

    I think you will get everything you want from weight and length.  If you go to DXA with a child at this age, and Ken will address this more I think in his talk, you're going to get, you know, fat, lean and bone, and the fat is probably the most important aspect here that you'd be concerned about for excess growth, but it's going to be described in weight, also.

    So unless you're wanting to tease a tissue out and say, okay, we're really concerned about the increase in fat here, in addition to the increase in weight, then, yeah, then something like DXA I think would be important.

    DR. STALLINGS:  Follow-up.  The velocity, then, would be what we would be looking at, more than just attained weight?

    DR. CHUMLEA:  I think you have to do both of them.

    DR. STALLINGS:  We, historically, are always looking I think at the attained weight.

    DR. CHUMLEA:  Yes, and you'd have to include the velocity in there because the velocity would, these children should, I think, potentially have much higher velocities.

    DR. STALLINGS:  And that might be a way of discriminating between the concept of normal growth and excess growth?

    DR. CHUMLEA:  Yes, right.  You could have children, let's put it this way, who have, say, after, say, three or four months, when their velocity should be declining, these children might not be declining as what the average is, so they're still obtaining a rather high velocity of growth at that point.

    DR. THUREEN:  Thureen.  I would argue that DXA is not a very useful body composition measurement for most studies because it's not that readily available, especially as a field tool, and I think that a lot of people are now starting to do more caliper measurements for assessment of body fat, even in very tiny infants.  And certain people, like Suda Kashyap, have gotten very reliable measurements over time.

    Do you think that the data from the NHANES study on body composition, using anthropometric measurements of body fat, was not useful or do you think it is useful?  And if you want to do large population studies, do you think that there is a future for caliper measurements?

    DR. CHUMLEA:  The DXA thing you can talk to Ken about.  I'll let him address that when he gets up here.

    There's two issues here, whether we're talking about small studies or large studies.  If you're wanting to do large-scale studies, population studies, like NHANES has done with NCHS, then collecting caliper information is going to be what you can do because there can be limits to what you can collect for DXA, particularly because of the issue of radiation exposure, although it's very minimal, even within the current NHANES, where they have DXA machines in all of the trailers, I think the limited age there is Age 8, from what they're collecting, although they technically have the availability of doing it in those particular studies.  Other people clearly don't have access to such expensive pieces of equipment.

    If you don't have access to that, then I'm not opposed to collecting the skinfold data.  The issue comes up that it is extremely difficult to collect accurately and reliably, and so, I guess, I kind of am in favor sometimes of no data is better than bad data, and I know that, in collecting it, it is something that people have to pay very close attention to, the technicians have to be very careful, and this is frequently something that in the course of studies, we pay lip service to it, and there's good attention, but these things do tend to fall out.

    Now, in smaller scale studies, this is something that can be done, and the information can be collected.  Overall, I'm just not that happy with the information.  Yes, there are studies where it's been done very well, and so I'm not putting those studies down at all.  I'm just talking about, in general, my experience has been, in collecting from infants in this age range, that this is really hard to do, and when it's something that's hard to do, it doesn't sometimes always get done the best way.

    DR. THUREEN:  Thureen, one more question.

    In your opinion, if you're looking at a growth outcome study, do you think incremental data are the gold standard--growth data are the gold standard, attained growth are both critical to an outcome study?

    DR. CHUMLEA:  I'm assuming, when you say a "growth study," you're going to be collecting repeated measurements from the same children, so you're going to have both pieces of information available there.

    The status value simply describes where the children are in reference to whatever reference values you're using for peers at that age.  That simply tells you that they're at certain percentile levels, but at the same time children also grow at different rates, and so there's a distribution of the rates at which they grow.

    So children who may appear to be at one percentile level, their rates of growth can be a different percentile level, so it gives a much clearer picture upon what's available.  And since any study where you're going to collect repeated measurements, you're going to have all of that information available to you.  So I would take advantage of it.  Again, the errors are difficult to control and need to be paid attention to for collecting it.

    DR. GARZA:  Cameron, I have two questions.  Given the fact that we are going to be providing advice to the FDA on the approval of specific formulas, how many measures do you recommend be taken if, in fact, one has an interest in the pattern of growth?

    DR. CHUMLEA:  If I was going to design the study, and you're not going to restrict me to what I want, okay.

    DR. GARZA:  From your perspective, if you're going to be protecting the public health and infants' health, what should the American public ask?

    DR. CHUMLEA:  I'd want a birth weight.

    DR. GARZA:  Birth weight.

    DR. CHUMLEA:  Clearly.  I would like it at, say, two weeks, one month--I'd like it again at two, and then at four, five, and six maybe, something like that.  The more measurements I could get out of the thing the better.

    DR. GARZA:  But you think that with seven measurements, one would be able to assess both the pattern of growth, as well as velocity of growth, at those specific time periods.

    DR. CHUMLEA:  Yes.

    DR. GARZA:  And you mentioned there were various sources of error.  Is there any consensus that FDA could rely on that deals with the nature of the equipment, the type of calibration that should be insisted upon, the training that obtaining the measures should be able to obtain and document, and the--those would be the three: equipment, calibration and the training of the technicians.

    DR. CHUMLEA:  There's a little bit of information about equipment errors and a little bit of information about the inter- or intra-observer errors for collection of measurements in the Anthropometric Standardization Reference Manual that's collated in one location.

    There's other pieces of information that are clearly scattered around the literature that are available.  From NCHS, there's really a limited amount of information.  There was really limited error data that was collected in NHANES III.  What's available from NCHS is principally from the earlier NHANES studies, and NHES.  So there's not much there.

    The techniques are described in a variety of locations, but there's not really anything that I know of that's really written down that says, okay, you can refer to here, and this is what you should do, in terms of training, collecting the measurements, et cetera, in one central location, no.

    DR. GARZA:  Any other questions or comments?

    [No response.]

    DR. GARZA:  Thank you very much.

    DR. CHUMLEA:  Thank you very much.

    DR. GARZA:  We'll move on, then, to the next presentation.  It's a topic that has already come up, body composition assessment in early infancy.  Dr. Ken Ellis, from the USDA/ARS Children's Nutrition Research Center, with Baylor College of Medicine.

    Again, Dr. Ellis, thank you very much for joining us.

    DR. ELLIS:  Thank you.  What I'm going to present today is a probably a little different from what most people have had experience seeing.  Some of this--in fact, all of this is going to be body composition beyond simple weight.  If one was interested in what the composition of weight is--let's see.  Which button do I...this is the laser, okay?  And which one of these is--can you hear me now?

    [Pause.]

    DR. ELLIS:  This is supposed to move the slides, she said.  I'm so used to pointing to the screen these days, so it's my fault.

    As you already heard, most people--or most of the work, at least in infants, has been with weights and heights, all kind of weights and heights charts.  But body composition, at least the first models that were attempted, the basic classic 2-compartment model is to measure--is divided in two compartments, fat and non-fat.  The direct measurement of body fat is really very difficult to do.  It's not an easy process to do that.  And so for many years what we did was we said that if we could measure precisely some parameter of the body that would represent the fat-free mass, then subtraction of the fat-free mass from the total weight would give us a measure of the fat mass.

    Part of the problem with that is that all the years in the  (?)  scale for the fat-free mass translate directly to the fat mass, and you'll see what I mean by that in a few minutes.

    Three classic methods that have been used for 50 years, or maybe even longer:  underwater weighing in adults--you can't do this in babies and infants.  Practitioners as well as parents tend to object to holding babies underwater.  Hydration, this is probably the more common thing you'll find in the literature that is done.  It's the dilution technique.  You give a tracer, collect the blood sample or some fluid sample several hours after that, then do some manipulations on that,  (?)  space to get water, and make some assumption about how much water there is in the fat-free mass.  And as we all know, hydration content of the fat-free mass in children at very early ages changes dramatically.  So depending upon what you assume, you can then quickly be off in your estimate of the fat.

    Whole-body counting is another method used to measure potassium content, primarily the body cell mass.  And, again, how much that relates to the--how much of that is a constant fraction or not of fat-free mass at these ages has also been questioned.

    So, again, like I said, the difficult underwater weighings, difficult to do the infants.  For the water measurements, they must swallow all the tracer, collect some kind of fluid sample.  Plasma is your best choice.

    The problem with this is you can't repeat it.  If you want to do the trial a week or so later, you can't.  You need to leave a sufficient amount of time for the previous tracer to clear, or you start increasing the doses of the tracer to compensate for that.  And, again, in the past, the most accurate assays required one to have a mass spec or availability of a mass spec, which is not in everyone's laboratory or garage, as I usually say.

    Whole-body counters, the problem with those have been over the years really they haven't been designed for infants.  They're really designed for adults.  There are a few of us who have done this, but in general they simply don't exist.  And even if you do have a whole-body counter available to you, most of the time it's not in a clinical setting.  You're going somewhere else to get the measurement done.  Again, the reason that tends to exclude it, at least for infants and children.  But one nice feature is you can repeat this as often as you wish, and so you could do it on a daily basis if one chose to do that.  So if you have access to a counter, one can count these infants as frequently as one chooses.

    And because of the way in which the counting procedure works, there doesn't have to be any really significant constraints.  They can move around and be--it will not really affect the results that much.

    Now, I looked at the various things that we received in reference to this meeting, and there's the document in there from the American Academy of Pediatrics where there's one little paragraph on body composition.  This was in June of '88.  And it says, "Normal growth implies appropriate composition of the increment in body weight.  Sequential measurement of various aspects of body composition"--such as water, fat and bone--"have the potential for defining changes in body composition."  However, at that time the opinion of the Task Force was that such measurements have not yet reached the stage of precision, non-invasiveness enough, and they're not that very convenient, and which I've just showed you that sort of in the three previous procedures, which I would agree in 1988, I would agree with that statement.

    That gives us a quick summary of that.  It just says you want to be able to--that the weight is appropriate composition, you will be able to do longitudinal measurements.  Again, for clinical testing, the precision, noninvasiveness, and convenience are the issues that they put out.

    Again, the 2-compartment model, I quickly talked about the limitations.  The density is not constant.  The hydration is not a constant.  Extracellular and water ratio is not constant.  Bone accretion is not constant.  Basically babies aren't constant.  We know that, right?  Because we know that when the baby comes back a month later, it's not the same baby we saw a month before.  It's a whole different child totally, at least from a body composition point of view, unlike adults, which really change very slowly over time.

    Now, so what's happened in pediatric body composition research since 1988?  There are sort of three general areas where there have been advances made.  One is bioelectrical techniques.  In pediatrics, I want to take a broad sense of pediatrics.  That's anybody under the age of 18.  And there's been quite a bit of work used with this technique in older children.  When it has been tried or used in infants, it has not been very successful at all.  In fact, in general, most conclusions with this technique--I should say the first two techniques, the bioelectrical impedance and the bioelectrical spectroscopy, which is the same as this but at two frequencies, have been that the information gained from those two techniques really hasn't been much more than you already knew when you had just simply weights and heights with these individuals.

    TOBEC, on the other hand, has been more successful, but the problem with TOBEC is that these machines are very--they're not common, there are not many out there.  There's probably not more than a dozen--a half a dozen, in fact, for infants.  And so, again, this is a technique that holds promise or has held promise, but, again, it's not a technique that is widely available at all.

    Absorptiometric techniques, DXA.  Remember that Academy of Science report?  It was in 1988.  Well, in 1993, the world of X-ray absorptiometry changed because at that time it went to what's called DXA.  It went to X-ray sources, different detectors.  The whole technology advanced substantially such that one could now consider this technology for whole-body measurements in infants, and one can do a localized region, such as the spine, if that's specifically what one would want to do.

    Another area, body volume measurements.  Remember, underwater weighing doesn't work.  This is an air displacement plethysmograph technique now, which is just actually started this year, so maybe in another few years, when this committee reconvenes, we'll be able to talk more about this.

    I'm going to basically focus on the absorptiometry or DXA or DEXA methodology because that right now holds the best promise for this kind of broad application.

    So the basic model, again, in 1988--I mean, there are more models, but the very basic model, again, we're talking about fat and fat-free mass.  And today, when one talks about body composition, one is almost forced pretty much to really address this kind of a model over here, 4-compartment model:  fat, bone mineral, mineral composition, ash, the water, and protein content.  And, in fact, it is these compartments that we're interested in when we look at body composition at any age and look at change in body composition at any age.

    So that 4-compartment model is now shown on your left, and what does DXA provide us?  DXA provides us--the only method we have that provides us a 3-compartment model for a single measurement.  We have a measurement of fat, which is directly the fat.  We get a measurement of BMC, which is bone mineral content, which is for the mineral compartment.  Eighty percent of this is this; there's another 20 percent which is distributed in the non-mineral, non-osseous compartments.  Then a third large compartment called the lean tissue mass, which is the non-bone, non-fat compartments.

    So when we do a DXA measurement of anyone at any age, we get this basic model, which has been a huge advance.

    This is what an image looks like in a child.  Most of the time people show images of adults.  Adults are very nice.  They hold their arms right.  They put their legs down straight.  Children, amazingly, 14 years of age, haven't learned how to do that yet, have they?  So they tend to lay the way they want to.  We do make measurements in children at our place.  We've done probably, I don't know, 600, 700 DXAs, at least.  Maybe a thousand.  I really don't know the number these days.  Quite a few studies.

    Let me show you, again, if one is interested in just the spine, this is not an infant spine.  It happens to be an adult image there.  But one can localize and make a measurement just at the spine for bone.  But for body composition work, we do total body measurements, which is shown here.  I think you can get an idea of the skeleton seen.  I think you can see the soft tissue parts that are obviously not the bone, and we can then get that information.

    Now, as you heard earlier, the BMI in terms of the height of heads, if one chooses to, one can actually decapitate the image and just worry about this part if you're concerned about how does the head contribute to all this information.  So it is possible to do that kind of stuff.

    By the way, the time it takes us today--in 1988, if you attempted an infant, which you would never be successful at, it would be 25 minutes.  Today we do an infant in less than 3 minutes with the newer scanners.

    So DXA, what are the advantages and some of the disadvantages?  First, DXA has almost achieved a reference status within the body composition field.  It still has some improvements to be made, but, again, in terms of everything else, it is the better technique that we have.

    Advantages, it does give us good precision and accuracy.  It is the only technique for a single assay that gives us basically a 3-compartment model:  bone, fat, and lean.  As you can see from that image, we can get some regional information if we choose to do that.  It has a very low exposure risk.  There's a very minimal amount of that.  And there are more the reference populations out there for adults, for children, and they are being developed, and several for infants, if you know the references to look for.

    The disadvantages, very low exposure risk, the same thing.  One could argue whether it's an advantage or disadvantage.  One of the ways I talk about this--in fact, I just thought about it coming over here yesterday on the plane--was on the flight I was on, there was at least five children under the age of 2 on that flight with me, and the radiation dose they got on that flight exceeds what you get from the DXA.  In fact, it's two to three times higher.  So there's an idea what the risks are involved.

    Scanners are not optimized for infants.  If you get a scanner, it's adult size.  They have not--the industry has tended to resist this, primarily because of the market that they are focused at, which is osteoporosis in older women.

    It doesn't give us a 3-D image.  You saw that 2-dimensional image.  It gives us 2-D not 3-D imaging.  That's what I call boot-strapped 3-C model.  It's not a perfect 3-C model, but it's not bad.  And the one different problem has been that the results differ between manufacturers, so that if you do a study--a multi-site center study, you want to stay with the same instrument, same software.

    Okay.  Precisions and accuracy of different body composition measurements and the minimal detectable change in an infant.  For this I chose basically a full-term infant and made the assumption it's 15 percent body fat.

    If you look at these methods, the water, the dilution method, or the bioelectrical impedance method or TOBEC, this is the potassium one here.  DXA, the bottom three, the fat, fat-free, and the bone, precisions, these are optimistic.  Precisions tend always to be a little bit better than--whenever you do a precision measurement study, they always do much better than they do random.  I'll guarantee that.

    This shows you the precision measurements here.  This is generally the accuracy.  Precisions are, let's say, in the 1 to 5 percent range or 2 to 5 percent range.  Accuracies tend to be in a 3 to 5 percent range.

    If you take these, this information, and take this size of an infant and you translate those into one of the minimum detectable changes for that infant, the values are shown here, the last column on the right.  And the percents are those percents of what that person had in terms of the composition at that age.

    So one can measure water, changes at 5 will start to show up.  If you use (?)-ium, they can get worse, TOBEC or BIAs.  Fat-free mass, 125 gram changes, only about 5 percent of the total fat-free mass.  Forty grams of fat, if that starts to change, it's 8 percent of that to implement this weight and composition.  You can start to see changes relatively quick with the single input.

    This shows the relationship between precision of the methods and what kind of a change, minimal change is required for that to become statistically significant, at the 5 percent level and a power of 0.8.

    I call this the clinical application in individuals, what I consider to be a clinical application, what's changed in that individual.  And if you look, for example, this is the relationship that the minimal detectable change--this is approximately 3 times the precision.  And so that if our precisions for BMC are somewhere around 2.5 percent, 6 percent change would occur, lean tissues at about 10 percent, and fat-free mass at around--if the precision is 4.5 to 5 percent, we'd have to see changes in the range of about 14 percent to be significant for that individual.

    Now, this I show you because this is the difference between 1988 and today.  I only got 30 seconds left?  Oh, well.  Well, okay.  Very quickly, this shows you the methods.  This shows you what the precisions are for FFM, fat-free mass.  If you translate those into fat-free mass, this is the tail here.  The top three were in 1988.  That's why you couldn't do it.  DXA is hugely improved since then.  This shows you the kind of weight gains that would have to occur in infants at, I guess, again, the standard term infant here, very small weight change with DXA, large weight changes for--I'm out of time.  She says zero.  Anyway, the--I'll keep on going anyway.  And the number of weeks would have to be changed in that individual.

    I can't believe I've taken up all the time already.

    This just shows you the rates of change that occur with age.  You can find this from several different sources.

    This shows you the first six months where you have to figure out what the rates of change are in the composition, and here's a series of papers that are on infants.  There's one on TOBEC.  This one used a series of methods which we can do.  All the rest you'll notice, with the exception of one here and dilution, were done with DXA.  The weight ranges are shown here.  The number of infants are shown over here.  We are now doing a meta-analysis to bring this together into one common reference database.

    And I want to just quickly go through these, again, precisions, 2 percent, 3 percent, 6 percent reported here.  And if you did a calculation--this is an interesting study in twins.  They looked at the weight difference, which is about 14 percent, and they calculated again with these alpha-5, power of 0.8, 40 to 45 infants would be needed to detect a 15 percent difference in one or more of any of these three compartments.

    Another paper here, let's see, this particular paper does give percentile curves for each of these values as a function of weight.  And, again, what I'll point out here is that even though you may have a 3.5 kilogram weight infant, the fat range can range from 10 to 26 percent.  By the time they're 10.5 kilograms of weight, it can range from 22 to 23 percent.  So weight does not represent fat.

    This paper is another one that has percentile curves of each of these compartments versus weight.  Again, comparable to the other one, actually these are lower fats.  Interesting, this is an European study.  The previous one was a U.S. study.

    And I want to show you--this is the last one, slide here, and I'm a minus five I think now.  But here this shows the changes--total body DXA.  These were preterm infants.  Initial weights were about 17--under 1750.  This was fortified human milk formula, and this was a preterm infant formula.  And the baselines are measured at 3 weeks of age, repeated again at approximately 3 to 4 weeks later.  And this shows the statistical ability to measure changes in body weight, for example, and in the fortified human milk you can make--you can change--see the difference at this level.  And the preterm formula, it's about 19.9 grams per kilogram per day.  Differences were four, and this is statistically different.  You can see these--in other words, you can compare in groups of 20 versus 30, you can see differences in weight, you can see differences in lean mass and fat mass and bone mass by DXA at three weeks between these two groups of children.  So it is possible with DXA to measure not only weight but the composition of that weight change in relatively small sample sizes.

    And if you want to convert those to growth kind of numbers, this represents about 2.3 grams per kilogram per day in terms of growth, and that's about 12 percent of the mean weight gain in terms of composition.  This again is 2.1 grams of lean mass per kilogram per day, 15 percent, about 1.2 grams per kilogram for fat mass and 76 milligrams per kilogram per day for the BMC.

    So the point here is that we can measure--I think there was a question, could we measure changes of composition that would be comparable to 3 grams per day?  The answer is--from this study the answer would be yes, we could do that.  Again, relatively small sample sizes of 25 to 30 children.

    I'll end there.  Thank you.

    DR. GARZA:  Thank you very much.

    Any questions or comments?

    DR. MOYER-MILEUR:  I have a question.  When you do your measurements in your babies, are they sedated?  Because we find that we require sometimes more than two technicians to keep a baby quiet to minimize the movement artifact.

    And my other question is, with preterm babies, we found it somewhat difficult to do early measurements because of equipment artifact, that they have leads and monitors on that make it very difficult to get a true assessment using DXA.

    DR. ELLIS:  Yes, two things.  One, none of the infants that we measured and none of the infants that any of these studied were sedated.  These are--again, they're all healthy children.

    Our experience has been if you feed them right before you want to do the measurements, they tend to be rocked in the chair by the mother or someone, they go out, and then you can make the measurement pretty easily.

    I don't know which machine you were using.  Was it a 4500A or 2000 or--

    DR. MOYER-MILEUR:  We have a 4500A, and we also have--

    DR. ELLIS:  You should do it in three minutes or less.  You do have to work at it.  I mean, normally with older children who will cooperate, that could be anywhere from age 5 to 18, depending upon what you're looking for.  But those children can get on a bed and will cooperate and can be--and they'll do it.

    Here you can do the whole procedure in ten minutes or less.  Here sometimes you have to spend as much as an hour to get the one measurement done.  You have to work at it.  But none of these children were sedated.  We don't sedate any children in any of our studies.

    DR. MOYER-MILEUR:  Yes, and I just, you know, would caution DXA in the infants in that it requires people with specialized training so that you can't just--

    DR. ELLIS:  Yes, yes.

    DR. MOYER-MILEUR:  --go to a community hospital and get their--

    DR. ELLIS:  Yes, you can't--you cannot send these children to a radiology department even with the hospital because they simply are not experienced with measuring children.  They just don't like it when children show up.  They don't have--infants, they basically will send them back.  They will not--they will not take that hour, hour and a half to do it in.  It takes effort sometimes.  Sometimes they go right on the bed and out.  It's always the ones that show up at 4:30 that take the hour and a half to two hours, though.

    As far as the artifacts, you're right.  You have to be careful about artifacts.  You can delete those off the images, though, pretty well.  If you take leads out, for example, out to the side, you can delete those right off the images.  And so that's a minor effect if you deal with it right.

    DR. GARZA:  Dr. Denne?

    DR. DENNE:  I was wondering if there are any direct comparisons in infants between skinfold thicknesses and DXA for fat mass.

    DR. ELLIS:  There may be a few, but, again, the issues have been that skinfolds are probably more difficult to get than the DXA.  We have skinfolds in some of our kids, but we just don't rely upon them for anything.

    DR. DENNE:  It would be an interesting comparison to make.  You know, relative difficulty depends on what you're actually used to doing.

    The other question is:  How is DXA validated in infants?  I mean, most of this body composition, you know, was validated against the other techniques which all have their own sets of issues.

    DR. ELLIS:  Yes.  The validation of DXA are done two ways.  One is with animals, small animals have been done.  We have done 73 piglets under the weight of 10 kilograms.  Other people have done comparable size piglets.  And probably if you add everything up, it's probably about 200 pigs have been done over the years at different centers with different machines.

    The pig is not the best of models because, for example, its bone is more mineralized than infant's.  Weight-wise, composition-wise, soft tissue is not that bad.

    The other way we've done it is we actually built phantoms.  We've actually fabricated mock-ups of the human body with parts made from polyester resin, doped with calcium and phosphate--phosphorous compounds to simulate that.  But that's how it's done.

    I have also done cadaver work.  The problem--not the problem.  The situation is that, unlike Elsie Widdenson, today's environment would not allow one to chemically digest the infant body, so we have done that in about 30--more or less 30--these are all preterm infants, and we did that by a technique called neutron activation analysis where we do a nuclear, chemical--nuclear chemistry technique where you measure calcium, phosphorous, sodium, chlorine, phosphorous, manganese and magnesium and potassium.

    And so if I look at the BMC bone versus fat, and if I look at the other ones and make some model--I have to make some modeling assumptions now about how much sodium is in the water, extracellular water and so forth, but they come out pretty well, with the 5 percent kind of accuracies.

    DR. THUREEN:  In the past several years, it's been recommended that at different centers, even if you have the same type of machine, you should do your own phantoms.  Do you think there's enough phantom data out there now that that doesn't need to be done?  Or if you're going to do a multi-center study, do you think that needs to be done?

    DR. ELLIS:  Well, for the multi-center study, there should be at least a common phantom that is going around to all those sites.  One, to do the initial calibration to be certain everybody is within reason of the numbers, and then continue on throughout going for the study.  That's typically what we do in all studies at all ages, whether it's infants, children, or adults.  That's what we do these days.  For multi-site studies, there's a common set of phantoms that go around all the time.

    DR. GARZA:  Dr. Stallings?

    DR. STALLINGS:  I'm one of those other six people in the world that's got the TOBEC, so I agree those, you know, have tremendous advantages.  But I don't think that we would be able to use them.  So I think, you know, bringing us to issue with DXA and how we could use it is an important question.

    Laurie asked one of the big questions that I'm always asked, which is about sedation, and I would agree that, you know, natural sleep and that sort of thing and working in the research setting.

    The other question sometimes is:  How many images do you really have to take to get the one right?  I even noticed on your slide the hand is--

    DR. ELLIS:  The hand was a little off.

    DR. STALLINGS:  And I just spent last week working with DXA and trying to figure out which one had all the body parts there and minimal movement.

    But would you share with the group, you know, how frequently do you need to do two scans or you get halfway through a scan and then you do it again to get a good research quality measurement?

    DR. ELLIS:  I would say it's definitely less than 10 percent that we have to repeat the scans.  It's like you say, there's a technician there.  The image is being acquired while the scan is being done, and you can stop it immediately to start again, as you well know.

    We have some--we have looked at some scans where the infant has moved, but we finish the scan and then repeat the scan again and looked at those.  It has a lot to do with what kind of movement you have.  As you well know, you can--if the child's arm is here--or say here when it starts and here when it ends, you have a three-armed child in the image you end up with, because it was here the first time you scanned through and caught on the second, on the lower case.

    We have found that if we have motion in this direction, there tends to be a minimal effect because you're not changing anything.  You're just moving the slice over a little bit here.  But it's when there's movement like this, a flapping of the arms or kicking of the legs, if they're doing that, we don't scan them.  We stop.  But that's usually what happens, they wake up or something.  Less than 10 percent.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  I have two, Ken.  How well described are the specs of equipment that one would need to be able to measure infants reliably?  Is there pretty much a consensus on the quality of the equipment, the DXA equipment that would be needed?

    DR. ELLIS:  You'd have to have something that's equivalent to what's called--there's basically two manufacturers in this country.  One is Lunar, the other one's Hologic.  You have to have at least the DPXL for the Lunar at least the 4500A or DelphiA for the Hologic.

    We always use Hologic's, and, in fact, we, you know, are constantly trying to improve those machines.  I'm not going to tell you they're perfect, but they're the best thing we have.  I think they could make them better.

    DR. GARZA:  You also indicated that individuals or personnel had to be specialized or had to be highly trained.  How much training do individuals need to be able to use this equipment reliably, or was the training in reference to just training and dealing with pediatric populations?

    DR. ELLIS:  It's more dealing with the pediatric populations.  It's more of that than it is simply for this, because once they understand--basically what you want to do is you want to minimize motion and have them in the right position and things like this.  But it's more dealing with the pediatric population, dealing with a child that may want to cry for 20 minutes or something, or a mother that could be apprehensive when she hears the child crying.  It's more that issue than it is anything else.

    DR. GARZA:  Dr. Stallings?

    DR. STALLINGS:  I just want to ask a little bit what Bert was doing.  If we were doing such a study, a multi-center study, what would your advice be about centralized reading of the scans, the technician at the instrument site?

    DR. ELLIS:  It is a good point.  These days, again, it is common practice now to send all the scans to a common central reading site because at least what happens there--well, I'm thinking more of the adults.  If there is any kind of bias--in adults you set regions of interest.  In the infants, it's a total body scan.  There's no region of interest set.  So it's less of an issue there.

    But, again, the judgment about good scans or bad scans would come from one source and not from different sources.  So it would be a reasonable thing to do.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  Thank you very much, Ken.

    DR. ELLIS:  Sure.

    DR. GARZA:  Committee members and guest speakers are invited next door for coffee.  We're going to break right now, instead of at 10:35 as on your schedule, so that we don't break up the following three presentations.  But I will ask everyone to try to get back here at about 10:25, 15 minutes from now, so we can assure that we don't eat into any discussion time with either the speakers or anyone else.

    Everyone else that is not a speaker or on the committee is invited to the cafeteria.  These are federal rules.  I didn't make them.

    [Laughter.]

    [Recess.]

    DR. GARZA:  The committee is seated at the table.  If I can have our guests please take your seats, we're ready to start.

    Our next speaker is Dr. Frongillo.  Dr. Frongillo is an associate professor in the Division of Nutritional Sciences at Cornell University, and he's going to give us an overview of the World Health Organization Growth Reference Study that was referred to a bit earlier by Dr. Chumlea.

    DR. FRONGILLO:  Good morning.  I'm going to stand here in the middle and use this archaic technology.  It might help if we could dim the lights up at the front here a little bit.

    These are some growth data from a single child, and you can tell what country they're from if you look at the units of measurement in pounds.  And if we plot the data, it's actually more interesting to look at.  We see a trend like we expect.  But to try to really discern anything about what the pattern is, it's helpful to compare it to something.  And so this is a graph--this is the old U.S. reference, the 1978 reference, and this is the same child.  And you can see this graph on the left, the child started off at the bottom of the distribution and then seemed to climb into the chart a bit, and then at about four months or so started to really--three to four months in there, started to really--its trajectory is now falling well below the chart.  And eventually by--this is about two years where you see it comes back onto about the same percentile where it originally started.

    This discrepancy that we see is either telling us something about this particular child or it's telling us something about the reference.

    Well, it turns out it's telling us something about the reference because that's a breast-fed child.  And if we look at a data set, this is a comparison that the WHO infant growth--an analysis that was done in the early 1990s, and what it shows is that, if you look on the left here, this is for boys, this is weight in kilograms and age going up to 12 months.  And the dotted line to the 1978 U.S. reference which was adopted about that time, just after that by WHO, is the international reference.  And the breast-fed data set, these are infants who were exclusively breast-fed for four months and then continued breast feeding through the first year.

    And you can see again that about four months you start to see the solid curves deviating from the dotted lines, which is the same pattern that we just saw.  And so this is showing that in a sample of about 426, I think it was, infants epidemiologically what we just saw in that individual child.

    And a similar pattern was seen for girls.  We can amplify this in a way by--what I've done is just simply take the current reference, the current international reference, the 1978 reference, and that's what would be at zero.  So if these children were growing exactly like the reference, there'd be a horizontal line right at zero here.  But what we can see is that this breast-fed set seemed to grow, if anything, a little bit faster at first, and then by the end of the first year had increased their weight substantially less than the U.S. set.

    In fact, if we calculated the difference in rates for these two groups from zero to 12 months, it's about 2.7 grams per day.  So this is something to keep in mind for later when we're thinking about how big our meaningful difference is.

    The difference from about one month, which is at the top there, from the maximum to the minimum at 11 months is about five grams per day.

    So that was for weight.  If we calculate a z-score--and we'll have--I'll put this up here because we're going to hear about z-scores at various points.  A z-score is where we take a particular measurement for a child and compare it to a reference median, whatever the reference is, and then divide it by a reference standard deviation.  So the graph I just showed you just showed the numerator there, but for a z-score we also divide by the standard deviation.  And the reason we do that is because then it's easy to imagine that the growth of a, quote, normal population would fall between about minus two and plus two z-scores.  About 95 percent of the distribution would fall there.

    So if we look at z-scores for this breast-fed set, then what we saw was that, regardless of the index that was used, whether it was length-for-age, which are the triangles, which is the curve near the bottom, whether it was weight-for-age, which is the circles, or weight-for-length, which is the squares, we saw a very similar pattern with this breast-fed set in comparison to what was then the U.S. and international reference.

    So this discrepancy, along with other information that was obtained during the review that the Infant Subcommittee made during the early 1990s leading up to the WHO publication in 1995 of the uses and interpretation of anthropometry, the recommendation was made that consideration should be given to making a new international growth reference.

    The justification for having an international reference is, first of all, that it allows cross-national comparisons to be made that otherwise couldn't be made; and since there's been an international reference since the late 1970s, it's allowed us to do some things in a comparative way globally that we weren't able to do before.  For example, this is a graph from a WHO publication that was in the bulletin of the WHO showing that the trends that have occurred from 1980 to about now so that we could actually look at the progress in Africa on the left, the very rapid progress that occurred in Asia.  This is in percentage--the percent of the population that's stunted--and the rapid progress that was made in Latin America and the Caribbean region.

    This kind of comparison has been made possible because there is a common reference being used throughout the world.

    In addition, we know that it's very expensive to make local references, and also that in developing countries where there's still a very strong cyclical trend in growth, if a local reference were made, it would have to be revised very quickly because of changes that are occurring.

    The justification for having an international reference goes back to work that was done in the early 1970s.  This is a well-known graph from a well-known paper, in '74 I think it was, showing that these curves right here were all curves of high SES children in well-off countries, whereas the ones that were down here were children in developing countries that were not so well off.  And so the fact that these were all so close together meant that growth roughly from one place to another where children are growing in conditions that are favorable to growth tends to be roughly about the same.

    Some work that the WHO has done with a cross-national data set collected by the Human Reproductive Program recently shows that--this is for girls--across a number of different countries, these are children who were reasonably well off SES, not necessarily the highest SES, showed basically that, with the exception of this lower curve, which is in China, that these other curves all pretty much are very close together.  Again, giving more recent evidence of the idea that it was reasonable to make an international reference where data from multiple countries could be combined.

    So let me tell you, then, a little about the effort that's underway in the Multi-center Growth Reference Study.

    First of all, I wanted to point out that a reference--the idea of a reference is that it's a tool for providing a common basis for the purposes of comparison.  So we're interested in references because it allows us to compare as opposed to a standard which then involves a judgment.  So here we're talking about making a reference, and during the early 1990s, both the U.S., in preparation for the revision of the U.S. reference, and also WHO examined the current reference which was being used in both the U.S. and internationally.  And the sample that had been used for the early infancy especially was from Fels, which was one particular place in Ohio.  The measurements were taken every three months, and in the very early period we might wish for more than that.

    There were very few infants that were breast-fed for an extended period of time, and at the time that this reference was made, there simply wasn't the technology to do curve-fitting that we now have.

    So those reasons, plus the main factor that the breast-fed infants seemed to grow differently than infants who were not necessarily breast-fed according to feeding recommendations, drove the decision to make a new reference.

    At the time the WHO feeding recommendation was that infants should be breast-fed exclusively from birth up to about four to six months, and then after that they should continue to be breast-fed for up to two years or beyond.  You may know that WHO recently--I guess about a year and a half ago--revised this to be from birth to about six months for exclusive breast feeding.

    So the objective of the Multi-center Growth Reference Study that WHO is doing is to build a set of growth curves for all children under age 5 years to be adopted as a new international reference for assessing the growth and nutritional status at both the population and individual level.

    When this effort was started, it was clear that there were a couple of conceptual issues that needed to be thought through.  One was that some references have been constructed, especially, for example, in the U.S. or in England, to take two examples, have been constructed to be descriptive references, meaning that they were intended to describe the growth of the population at a particular time.

    This is different than what's going on in the Growth Reference Study, the WHO Multi-center Growth Reference Study, which we can think of as perhaps a prescriptive reference, meaning that it's meant to be a reference that depicts the growth of infants who were fed according to current recommendations for how children should be cared for during infancy.

    The other issue had to do with maximal growth versus optimal growth.  In the past, we've had a tendency to think that maximal growth and optimal growth are the same thing.  The graphs I just showed you indicate that when infants are breast-fed, at least during the first year, and perhaps into part of the second year, they are not the maximum size they would be if they were breast-fed, but we think that because they're fed following feeding recommendations that that corresponds to optimal growth.

    So the design of the study involves multiple geographically diverse sites.  There's a longitudinal component which goes from zero to 24 months.  And each site was asked to recruit about 300 infants per site in the hopes that at least 70 would be available for inclusion in the final reference.  We've actually done better than that because the compliance with the feeding recommendations by the mothers and infants has been much higher than the 25 percent that we feared might be there.  So we've actually ended up with quite a bit more than 70 percent.  And then very frequent measurements, I'll show in a minute, and then there's a cross-sectional component which overlaps the longitudinal component.  It starts at 18 months, goes up to 71 months, past 5 years, to make sure we have enough data on the right-hand side to be able to characterize growth well up to at least 5 years.  And sites were asked to recruit about 1,400 per site, which, again, would give a minimum at each age of about 70.

    In the longitudinal component, which is a very demanding part of this study, measures of weight, length, and head circumference are collected frequently during the time.  At birth, there's one visit, of course, and then in months 1 to 2 they're biweekly.  So there's four visits there.  In months 3 to 12, measurements are monthly, so there's 10 visits for that.  And then in roughly the second year, they're bimonthly, which is six visits during that time.   And then arm circumference and skinfold measurements are also taken in the same schedule, starting at 3 months, as in the other measurements.

    Now, the way the study was constructed, there were a set of criteria at the population level and then a set of criteria at the individual level.  So at the population level, the idea was to find populations of infants who did not have socioeconomic constraints on growth, where mobility would be low so that they could be followed, where at least 20 percent were willing and able to follow the WHO feedings recommendations, with support, so there had to be existence of or at least the ability to build breast-feeding support systems; and then there had to be local presence of collaborative institutions who were capable of carrying out this kind of exacting work.

    Then at the individual level, individual criteria were set that there was an absence of health, environmental, economic constraints on growth; the mother was willing to follow the WHO feeding recommendations; that the mother was a non-smoker; that gestational age would be at term, which we defined to be 37 to 42 weeks; and that the infants wouldn't have any severe illnesses that would be expected to affect growth.

    The protocol for the study site selection then applied to subpopulations the fact that socioeconomic status did not constrain growth, it was low altitude, low mobility, the minimum of the 20 percent, existence of breast-feeding support systems, the local institutions.  We looked at the rate of hospital deliveries because we had to know that there were enough infants being produced quickly enough that they could be enrolled in the study so we could get the study done sometime in our lifetime; that there would be sufficient numbers of eligible birth; and that it was feasible within those locations.

    In some places, for example, in a really huge city of 10 million, it's just not feasible to do a study like this.  The logistics are too difficult.

    The Steering Committee also considered some other factors in its thinking about in looking at mean birth weights, maternal heights, complementary feeding practices, health-related behaviors, and the existence of environmental hazards.  The Steering Committee looked at geographic distribution.  It's a global reference, so WHO tried very hard to have geographic representation throughout the world, and funding issues, because it's expensive to do this kind of study and we had to think about where the funding would come from and how that could be arranged.

    The protocol was developed by this set of characters here.  The main reason I put it up here is to show you that it's a multidisciplinary set of people representing a diversity of backgrounds and institutional relationships.  So this group put together the protocol roughly in the '95, '96 time frame.  And then the study is being run now with an advisory group:  Cameron Chumlea, Tim Cole, myself; Ray Martorell is the Chair of this group; John Van den Broeck, who recently moved to South Africa; senior scientists representing CDC and UNICEF, previously was Roger Shrimpton in UNICEF; and then WHO, the day-to-day work gets done at the sites and at WHO Secretariat.  Mercedes de Onez coordinates all of this with her staff there.  A very dedicated group of people.

    The sites that have been selected are the following:  in Pelotas, Brazil, in the south part of Brazil near the coast; Victoria is the PI with Cora Post; in Oslo, Norway; in the U.S. at Davis; in Muscat, Oman; in Accra, Ghana, the capital of Ghana; and in New Delhi, India.  These are the six sites.  Each of these sites represents very differing and very large challenges to carrying out this study.  And it couldn't be done without the commitment of the teams there who are doing the work on the ground.

    The Steering Committee is chaired by Cutberto Garza representing UNU.  Data management is done by local data entry and checking at the local, each site, and then the data are shipped to the WHO Human Reproductive Program.  They have extensive experience in handling large, multi-country data sets, and they've done a fantastic job in coordinating all of this and in helping to ensure data quality.

    The decisions and information about the study, there was a working group on the growth reference protocol.  We have Steering Committee and Advisory Committee meetings periodically, other meetings.  Particular tasks are handled by other meetings.  We do a lot through electronic mail.  Various site visits are made, were made before the study started in preparatory work and are made throughout the study, and rapid surveys were done at the beginning to get information that was needed to actually do the planning for the data collection in the particular sites.

    To give you an idea of what's involved, I just made a list here of sort of the documentation that's been produced, which will give you a feel for what was necessary to carry this off at the level of scientific quality that was being strived for.  The protocol was developed, a measurement of standardization protocol was developed.  A manual of operations, a generic manual of operations was produced, and each site had to adapt that for its particular location.

    There was a protocol for the 12-month visit.  A special effort was made throughout this study on the epidemiological quality of the study to follow all infants.  Even if mothers wanted to drop out or weren't complying, an effort was made to follow them as much as possible so we could keep measurements for every infant even if they weren't exactly following the feeding recommendations.

    In particular, we had an effort made that at 12 months we could go out and get at least some measurements on all those who were not willing to continue.

    There were guidelines for complementary feeding.  A protocol was developed for assessing diet, for the cross-sectional study, for data management, and then questionnaires were produced for both the longitudinal and the cross-sectional study.  And if any of you have been involved in questionnaire production, you can imagine what those meetings were like as people argued about the exact wording of every question and every answer.

    So that's an overview of the study.  Basically where we are now is that the last site--the sites have been selected over time.  Some were able to start earlier than others.  Some had technical challenges that took longer than others.  So in the next few months we'll be finished with data collection.  There's a meeting coming up to look at and try to decide on the final methodology for analysis.  Some preliminary work is underway.  So this will be proceeding, and we're talking about having a reference be available in the 2005 year.

    There's a lot of work that needs to be done in order to prepare for not just producing the reference but preparing for how it will be depicted and how it will be used and testing that will have to be done with the reference under the auspices of WHO.

    DR. GARZA:  We have about five minutes left for questions.  We might be able to go over.

    Dr. Stallings?

    DR. STALLINGS:  Well, one, to compliment the group.  This is an extraordinary effort and an extraordinary study.  But to cut to the chase, do you think that we will see this used in the U.S. as the growth standard for infancy through 5 years or even infancy through 3 years with what we think of now as our traditional infant chart?  So I'd be interested in your opinion and certainly in relation to what we're here for, which is to start to understand the best comparison group for children in the U.S. who are taking infant formula.

    DR. FRONGILLO:  Okay.  Well, I think I'll probably not try to answer your question so directly.  Let me just say that I think the advantages of this reference will be that it's longitudinal, that the longitudinal data in the first couple of years will have the ability to look at--to have a reference, a velocity reference, and so a judgment will have to be made whether that's better than, for example, the Iowa/Fels data that are available.

    The second thing is that we know that infants who are fed following the breast-feeding recommendations will show a different pattern of growth than formula-fed infants.  So to the extent to which it's seen as desirable to have a reference which fits that growth pattern for infants who are being breast-fed, then I think that would certainly be the advantage of the new reference.

    DR. GARZA:  It may be useful to describe the references that are going to be available.  Is it just weight and length, or are there others?

    DR. FRONGILLO:  Well, those reference data will be available for all of the measures that I showed, so we will have data on weight and length, head circumference, arm circumference, and skinfolds.

    DR. GARZA:  Any other questions?

    DR. THUREEN:  Why did you choose to include skinfold measurements, and what kind of information did you hope to get from that?

    DR. FRONGILLO:  Well, I think that there was a debate about, you know, the importance of this and when it should be started, and I think the lack of reference data on skinfolds was very compelling and it was thought that, given the potential usefulness of that information in the future, that while a study of this effort was--while this large effort was being made, it would be important to have that kind of information available.

    DR. GARZA:  Dr. Baker?

    DR. BAKER:  I have a question about the prescriptive nature of this.  If you're going to do a study like this using a prescription, it assumes the prescription is right.  It also assumes that it also would change, presumably, if the prescription changed.

    Now, the WHO has changed it since this study was done.  Does that make a difference?

    DR. FRONGILLO:  Well, certainly we thought about this a lot during the time in which the study was being planned.  I don't think anyone imagined that the basic recommendation about breast feeding and complementary feeding is going to change appreciably, and at least not for quite some time.

    Now, if it does, one of the things we wanted to do and part of the reason for the intensive follow-up even for infants who didn't exactly follow the current feeding recommendation was that it allows us to have the information available so that if 15 years from now we have new knowledge and decide that some slight revision of the feeding recommendation is made, anybody who's ever thought about changing the feeding recommendation will run after that possibility.

    But, anyway, if anybody gets brave enough to try to do that, we will have the information available from the cohorts in the six sites so that one could conceivably reconstruct the reference to conform with that.

    DR. GARZA:  Yes?

    DR. DOWNER:  Have you decided exactly which tool you will be using to do the skinfold measures?  And my second question is:  Because what we consider SES for different world populations differ so widely, how are you going to decide on what to use in this study?

    DR. FRONGILLO:  Okay.  The skinfolds, which tool, do you mean which skinfold caliper?  Basically I think--and Cameron can comment on this because he's the expert here.  But my experience is that it's not the caliper that makes much--any difference at all in the measurement.  It's the quality of the enumerator and their training in using the instrument.  So that's not going to make any difference.

    DR. DOWNER:  What instrument have you planned on using?

    DR. FRONGILLO:  The instrument we are using is--what is the instrument we're using?  Holtain, right.  Okay.

    And the second question?  I'm sorry.

    DR. DOWNER:  The SES.

    DR. FRONGILLO:  Oh, the SES.  In each site, a survey was done before the study began, the main study began, to actually look at the relationship between socioeconomic status and growth so that we could develop in each site exactly what the criteria needed to be from a socioeconomic standpoint in order to ensure that the population of infants selected was at high enough SES to not constrain growth.  So that was done separately in each of the sites, and some of us traveled around to different sites to help them actually carry that out.  And you're right, in each site different criteria were needed because the conditions were different.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  All right.  Thank you very much.

    We'll move on then to Dr. Larry Grummer-Strawn, who is the branch chief of the Maternal and Child Nutrition Branch at CDC, and he will tell us about the NCHS/CDC's growth charts.

    DR. GRUMMER-STRAWN:  Good morning.  If we can figure out how to forward this?  Which one?  Just here?  Okay.  Thank you.

    This morning I want to give kind of an overview of the new growth charts and contrast them to the old NCHS growth charts.  I'm going to start off with kind of a historical perspective.  I'm sure many of you know the history but to kind of just set a context for all of us, do some comparisons of differences and similarities between the old and the new, and then go into some of the differences a little bit more explicitly, and then finally end with some analytic issues that the charts pose for us.

    The original NCHS charts were released in 1977.  Those charts were only percentile curves.  They were published by Hamill, et al., really became the standard of reference for all U.S. infants.

    Subsequent to that, there was a normalization of those curves at CDC.  Those were actually published in 1987, but were actually available for use long before that.  So people who were interested in normalized curves had access to them earlier.  And, finally, the WHO adopted those curves as being the international reference, really referring to the normalized curves.  The adoption by WHO actually came prior to the publication of the normalized curves.

    The reason that I point this out is that those curves never became one and the same.  The percentile curves never matched with the normalized curves, and so someone who was using clinical charts that actually saw the graphs in front of them was not necessarily using the same cut-off points as someone who was using computer software or might be analyzing data sets.  They were very similar to one another.  They were analyzed off of the same data, but were slightly different from each other.  And then, finally, in May of 2000, CDC released a revision to these charts.

    Now, at the time that the original charts were created, NCHS was a separate agency, and so they were referred to as the NCHS Growth Reference.  When people talked about the normalized curves and put them in an international context, they might have referred to the NCHS/CDC/WHO because of the separate role each of those agencies played.

    Subsequent to 1977, NCHS was actually incorporated into CDC, and so the new charts are referred to as CDC charts.  That does not mean that NCHS was not an active player.  They actually were the progenitors that moved the new charts forward.  But it was in a larger context of CDC, and other parts of CDC were also involved.

    So what are some of the similarities?  First of all, both sets of charts are looking at the same indicators.  We have weight-for-age, length-for-age, weight-for-length and head circumference-for age.  Both sets are sex-specific.  In neither case did we have any separation according to the parental anthropometry, race, ethnicity, infant feeding mode, different things that might impact on the growth of the infants.

    Ed just described for us kind of the difference between the idea of a reference and a standard or a descriptive reference and a prescriptive reference.  These clearly are references, not standards.  The only kind of movement toward a standard is that very low birth weight infants, that is, less than 1,500 grams, were not included in the new charts, the CDC 2000 charts.

    These charts reflect attained size, not incremental growth, and in both cases, we have accessibility of percentile scores and z-scores.  In the new charts, the z-scores are one and the same.

    The differences between the old and the new are, first of all, that in the 2000 charts, the data for infants are now nationally representative.  These represent a broader spectrum of race and ethnicity across the United States, a broader spectrum of socioeconomic status, and there's an increased representation of breast feeding in the charts.  That doesn't mean that they are primarily breast-fed children, but there's a mixture of formula-fed and breast-fed children.

    The 2000 charts are based on a pooling of several data sets coming together, whereas the 1977 NCHS charts were all based on the Fels data set.

    There were some minor changes to the smoothing techniques, which I will mention briefly.  As I said, the z-scores now are one and the same as the percentiles.  There's a one-to-one match on those, so it doesn't matter whether you use computer software or you're using printed charts.  You're going to be looking at the exact same cutoffs.

    Another minor difference is that the length now extends down to 45 centimeters rather than 49 centimeters when we're looking at weight-for-length, and when ow have on the clinical charts the accessibility of 3rd and 97th centiles.  Of course, based on normalized curves, you can get any kind of centile that you're interested in, but the difference is that the clinical charts that are produced actually do extend out to the 3rd and 97th percentiles, and the smoothing-out rhythm is intentionally extended out to those centiles to make sure that we're incorporating the original data out that far.  And, finally, as I mentioned before, very low birth weight infants are excluded.

    So what are the data sources?  Well, in 1977, I'm sure you're all familiar with the Fels Research Institute study.  It was done in Yellow Springs, Ohio, primarily represented Caucasian, middle-class families.  And while not exclusively formula-fed, this group is considered to be almost all formula-fed infants.

    There was a longitudinal follow-up study.  The children were followed from birth, 1 month, 3 months, 6 months, and at three-month intervals after that, but we're focusing on the first 6 months today.  The data were collected between 1929 and 1975, and there were a total of 867 infants.

    Now, the data for the CDC 2000 curves, as I said, represents a number of different data sources.  What I've put up here is a graphic showing you how at different ages, different data sets come in.  I'm going to talk about kind of these different data sets at different points.

    The primary data source is the NHANES III data here when we're talking about children birth to 6 months of age.

    Now, this is represented by the long line here in light blue.  The NHANES III was the only data set for which we had nationally representative data prior to 6 months of age.  Starting at 6 months, the NHANES II data were also available, and starting at 12 months the NHANES I data were also available.

    Now, the reason that these are important for us today is that because we're smoothing these curves across age, the influence of those older data sets does come in at 6 months of age and at 12 months of age.  The smoothing is across all ages, and so the curves below 6 months are also affected by those other data.

    However, the NHANES III data started at 2 months of age, and among 2-month-olds, it was a fairly small sample size.  In order to extend these curves down to birth, we had to look to other data sets.  And in each case, whether we're talking about head circumference-for-age, length-for-age, weight-for-age, weight-for-length, in each case we have to turn to different data sources.

    I'm going to start in the middle here to talk about in the weight-for-age.  In this case, it was fairly straightforward to use a birth point coming from the national birth certificates.  We had all of the birth certificates during the years that the children in these NHANES surveys had been born and had the birth weights available on all of those.  So it's a huge sample size, a very precise point that is truly not only nationally representative but a census of all births in the United States.  And so in creating the curves, we were able to connect that particular point, actually anchored the curves to that point, and then smoothed it with the data starting at 2 months of age from the NHANES.

    However, we don't have national data on any of the other indicators, head circumference or length.  With regard to length, we did find that there were two states that routinely collect length data at birth in a representative fashion.  We analyzed the data in those states for their birth weights against the national birth weight distribution and found that they were quite representative--those states were Missouri and Wisconsin--and felt that because the birth weight distribution matched the national distribution, we could expect that they should represent the national birth length distribution, even though we do not have data on the national birth length.

    So when we looked at the weight-for-length curves, those are based on connecting the dots between the Missouri and Wisconsin data on weight-for-length versus the NHANES III data weight-for-length, and those curves were connected together.

    Initially, we intended to do the exact same thing with length-for-age, use only the data from Missouri and Wisconsin to connect these curves across age and with the NHANES III data.  We did that in our first pass but analyzed--as we were evaluating the curves against alternative data sets, we found that we were comparing against the Chicago data set, the WHO pooled data set that Ed just described for us, as well as some of CDC's surveillance data, and found a common pattern in all three of those that the curvature between birth and 6 months did not match what we found in external data sets and felt that this was partly an artifact of the fact that we only had about 35 infants from the NHANES III data that clearly were not matching the normal pattern of growth.  And so the curves were being pulled in the direction of those NHANES III data from a very small sample size.

    So what we opted to do was to choose an additional data set to add in here between just beyond birth--these were not birth points, but at the first visit to a clinic--up through 5 months of age from the CDC's Pediatric Nutrition Surveillance System.

    These are data on low-income infants.  However, we didn't use a representative sample of low-income infants.  Instead, what we did is we chose clinics that matched the national distribution in terms of their mean, standard deviation, and skewness at each age from birth--from 3 months of age through 11 months of age compared to the NHANES III data.  So we were pulling out clinics that the children in that clinic happened to look exactly like the national distribution and chose those clinics and assumed that they would also look like the national distribution would have looked between birth and 3 months, and then added those data to the curves here and connected using the Missouri and Wisconsin data, the CDC nutrition surveillance data, as well as the NHANES III, and continued the curves using that.

    Finally, for head circumference, we had no national data, and we returned once again to the Fels Institute data for the head circumference at birth point and, again, connected that with the NHANES III data.

    So you see the picture here is one of bringing together multiple data sets.  We had a number of comparisons to try and make sure that these were valid comparisons to make, but it certainly leaves us with a difficulty using multiple data.

    So what are some of the other differences?  First, with regard to the smoothing, the old curves were smoothed with cubic splines, with knots at birth, 6 months, and 18 months, for those of you who work on these statistical arenas.  What that meant for us was that there were six independent parameters that characterized growth between birth and 3 years of age.

    In the CDC 2000, a completely different approach to smoothing was applied.  Here we used fractional polynomials that had been used previously in other growth studies, primarily used in the Fels data as well as some Canadian data.  And so they had kind of been proven methodologies for assessing growth during the first three years of life.  However, there were a smaller number of parameters that described growth, really just three independent parameters--roughly three, because there were some other things that were done to the curves to get them to fit.

    Finally, on weight-for-length, there were no set models, and so we used a 5th degree polynomial to maximize the flexibility of the curves there.

    I mentioned before that in 1977 the standard deviations were estimated as a separate path and created a separate set of curves.  In that case, we had two different standard deviations.  There was a standard deviation above the median and a standard deviation below the median.  And so if you think about kind of plotting the standard deviation as it goes across the curve, it was a low standard deviation that instantaneously at the median rose to a higher level.

    We calculated these scores in the normal way, taking the measure minus the median divided by the standard deviation.

    In the CDC 2000, this kind of immediate change in the standard deviation at the median was thought to not be desirable, and so instead we had a more continuous change in the standard deviation.  In this case, we transformed the data with a Box-Cox transformation, a power transformation.  People understand how you take the log transformation of a data set or a square root transformation.  The Box-Cox is a continuous set of transformations that you can then choose a parameter to say how much you want to transform that data to make it symmetrical.  And then once it is symmetrical, you can fit parameters to normalize that curve.

    Some other differences, clearly this group would be interested in the differences between the infant feeding in the groups.  As I mentioned before, the old charts were virtually all formula-fed infants.  The quality of the infant formula across that time, of course, has changed, so it is a mixture of a number of different kinds of feeding across those year '29 through '75.

    In the CDC 2000, we have a mixture of breast feeding and formula feeding, but it still is primarily a formula-fed group.  At 2 months of age, about half were formula-fed, half breast-fed, but by 6 months of age that was down to 28 percent currently being breast-fed.  And we're not talking about exclusive breast feeding.  As you can see, the exclusive breast-feeding rates are much lower than that.  Down to less than 10 percent by 6 months of age were exclusively breast-fed.

    If I can just take a couple of minutes to show you a comparison between the old and the new curves with regard to breast feeding, and we'll stop there, Ed showed a number of things as to how the old curves performed against WHO pooled breast-feeding data set.  We did some additional comparisons seeing whether the new curves have actually improved that situation.  So we've compared the WHO pooled data set that came together from six different studies of exclusively breast-fed children and pooled those data together.  And instead of plotting the means, which is what Ed was showing us, here I'm going to show the percent below the 10th percentile using the old curves versus the new curves.

    When we're looking at weight-for-age, you can see that there really has not been much of a change between the old curves and the new curves.  We have this same problem that as children get older, we're going to diagnose more of them as being underweight.  Just as Ed was showing us that the means go down, the percent that would be low is going to get higher.  And we still have that problem with the new curves, that it is considerably--we would have considerably more older children being considered underweight in this breast-fed data set compared to the younger children.

    However, with regard to height for age, we've improved the situation somewhat.  With the newer curves, there still is a tendency toward increasing the percentage that would be considered low as we get to older infants, but it's not as steep a trend as we had with the older curves.

    And, finally, with regard to weight-for-height, whereas the old curves showed that same pattern of worsening nutritional status over the first year of life, on weight-for-height the new curves have pretty much wiped out that problem.  We see a much flatter distribution across age.

    I'm going to skip over these because Ed is going to come back to many of these points in his description of the analytic issues.

    Conclusion:  The interpretation of the new charts is really not widely different from the old charts.  We're still using the same kind of way of thinking about growth and the way we analyze growth as very similar in the old curves to the new curves.

    There are a number of enhancements that argue for changing over to the new curves.  I'm not arguing that we haven't made enough of a difference to adopt these new curves.  However, I do think that the WHO reference that Ed has described would relate to a more substantive change in our interpretation of growth parameters, and we really need to give much more thought to different ways of thinking about growth than we have so far.

    Thank you.

    DR. GARZA:  Thank you very much, Larry.

    Any questions?  Dr. Stallings?

    DR. STALLINGS:  I actually have a series of questions, lots of things jotted down.  One, also, thank you and your team for doing this.  It was a monumental change to give us this from the clinical point of view.  It's wonderful to have the charts revised.

    In that last set of slides where you were showing the less than 10th percentile, just so I'm thinking about it correctly, the perfect outcome would be 10 percent would be less than the 10th percentile, because if we were looking at a population study, would you by definition expect 10 percent to be less than the 10th percentile?

    DR. GRUMMER-STRAWN:  Yes.  What I'm more concerned about is the pattern of growth there than the actual level.  We're comparing curves that are based on formula-fed infants mixed with some breast-fed infants against a group of breast-fed infants.  And so I wouldn't have been surprised if the level was somewhat different in a group of breast-fed children that might have less malnutrition than a general U.S. population.  But I would expect the pattern should be representative of the pattern of growth.

    DR. STALLINGS:  But, still, when I was looking at it, when there was almost zero less than the 10th percentile, I saw that as unusual as when we have 20 percent less than the 10th percentile, that that's sort of the concept.  If we were getting--whatever right is, but if we were getting it right, the population would go along that.

    While we have all the experts in the room, we've heard three different people say three different things about whether the Fels data included a lot of breast feeding or not.  And before you guys leave today or tomorrow, I'd like that to be readdressed.  You know, you were very helpful in showing in your slide the percentage of children who were breast-fed and then exclusive, and so obviously it's not one number.  Where you are at 2 weeks of age is different from 6 months.  But I think it would be helpful for the committee and for the FDA to have an understanding of what we believe the Fels data represented as exposure to breast feeding and what you believe your 2000 charts represent.  This is--

    DR. GRUMMER-STRAWN:  I'd like to have that, too, because I certainly--

    DR. STALLINGS:  Okay.  Well, then, you guys aren't leaving until we get this right.

    [Laughter.]

    DR. STALLINGS:  The other part of that is I think getting a handle on that will be even more important as the WHO new charts come out and those data are there.

    My last question, which really is a question, is:  If I understand, then breast-fed babies grow faster during the first 4 months of life compared to our usual reference data and more slowly between 4 and 6 months, if we were to look at the zero to 6-month period that we're really supposed to be focusing on.

    DR. GRUMMER-STRAWN:  I think it's more like around 3.  Is that about the peak?  Three months of age.  So faster in the first 3 months of life and slower after.

    DR. STALLINGS:  So that's part of the pattern that we're trying to capture that has made individuals difficult to look at.

    Thank you.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  I have two.  In thinking through the reasons why the WHO took more frequent weight measurements, one of the principal things that drove that was that very often the pattern of growth was used to assess the nutritional management of infants, the first 3 to 6 months.

    To what degree can the present reference between used with that amount of detail, the first 3 to 6 months?  Were the smoothing techniques in your judgment sufficient to capture the differences in growth patterns that Ginanne just described?  Or did the smoothing eliminate much of that?

    DR. GRUMMER-STRAWN:  First of all, you have the whole difficulty of dealing with the cross-sectional data as opposed to longitudinal data.

    DR. GARZA:  I know.  That was Part B to 1.

    [Laughter.]

    DR. GRUMMER-STRAWN:  So all of those issues come in there.  You have not a very large sample size in that age range, and so there is a fair amount of noise.

    In addition to that, you're fitting basically a three-parameter model to the first three years of life.  That doesn't give you a whole lot of degrees of freedom to really let that first few months take on a particular shape.

    That said, we did examine what the curves were doing in that age range, and they looked like they fit fairly well.  But there's a large amount of noise in the cross-sectional data that bounces from month to month, and you look at the curves and say, you know, I think we did the best we can given the data that we have.

    So I don't want to blame the three-parameter model, but I would say that if one had better data and wanted to capture really what is the pattern of growth in the first year of life, I probably would not do the smoothing in the way that it was done.

    DR. GARZA:  And the second, since you answered Part B of 1, we've been asked to look at various control groups, either historical or using specific references as controls in clinical studies.  You also make the distinction between standards and reference.  To what degree can, in fact, one use the present reference in making judgments, value judgments for clinical studies and making comparisons between feeding groups that might be placed on new formulas and the current CDC reference?  Is it sufficiently robust to be used as a standard in making that clinical judgment that control groups normally play in clinical studies?

    DR. GRUMMER-STRAWN:  I think the question is one of what do you want to compare to.  What is right?  In comparing to these curves, you are implicitly saying I am comparing this child's growth or this group of formula-fed infants' growth to the way children have grown in the United States essentially over the last 10 years, maybe 15 years.  Are you comfortable with that kind of a comparison?

    You would say based on formula--in assessing a formula, you are saying this particular formula that we are evaluating generates a pattern of growth similar to the way children grow in the United States, whether they are formula-fed or breast-fed, fed on all kinds of different formulas, mixtures of solid feedings coming in at various ages.  If you are comfortable with that kind of a comparison, this formula produces a pattern of growth like children in the United States, like a cross-section of all the children in the United States, then I say yes, this is a perfectly fine comparison to make.

    If you want more of a prescriptive statement that this formula produces a pattern of growth that is the most healthy, I don't think that you can say that.  I don't think that we can infer that a cross-section of infants from the United States with all of the variety of primarily infant feeding patterns--there are also varieties in terms of socioeconomic status, the kinds of conditions children are running up against.  I think that in terms of the impact of those on these curves, those are not as great.  But the variation in feeding patterns, I don't think that we can say that we have the best pattern of growth here.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  Thank you again.

    We'll move on to the next paper.  I don't think Dr. Fomon needs an introduction.  I can't think of a more senior person in pediatric nutrition than Dr. Fomon, and I'm very pleased that he was able to join us today and was willing to leave lovely Texas for Washington for this purpose, from one native Texan at least.  And he will be addressing the Iowa data and the Iowa/Fels growth data.

    Thank you very much for joining us, Dr. Fomon.

    DR. FOMON:  As the most senior member of the presenters, I'd like to make a statement for the presenters that we were asked to prepare a 20-minutes presentation, and if we run over and get the zero sign at 15 minutes, we're not very apologetic.

    [Laughter.]

    DR. FOMON:  First I want to get out of the way what are the Iowa data and what are the Fels data, and then we can talk about more interesting things.  The Iowa and the Iowa Fels data, Iowa Fels data is published by Guo, et al.  Term infants measurements all made with highly quality controlled efforts.   Caucasian, we were able--and I will show you that--no, but it's in my paper--that the length and weight of the Fels and the Iowa series were very similar at three months, which gave us some encouragement in combining them.

    Iowa data.  There are other Iowa data, but I wanted to first speak about 8 to 112 days.  These were all formula fed infants, 380 males, 340 females, and mostly infants of upper socio status, but not necessarily economic, because they were mostly UI personnel, University of Iowa faculty and students.

    They were measured within 4 days of 8, 14, 28, 42, 56, 84 and 112 days.  No exceptions.  And the age of measurement by interpolation or extrapolation was made to the target age.  So we adjusted if a baby was measured at 16 days.  We used the 28, 16 and 8-day measurements to compute a 14-day measurement.  It was fully longitudinal.  There is no one of those 380 males or 340 females who was not measured at every time.  This is published, so you can read about it in books.

    Then we did also measure a number of infants from 112 to 196 days of age, and they were 165 males, 188 females, and there was a subsample of these that were also in the other group that I talked about.  There were 63 males and 74 females who went from 8 to 196 days.  We have many more now, but it's too expensive to do the analysis.

    The Fels data, there are a lot of Fels data.  The Fels data that I'm talking about are the Fels data that are included in the Guo, et al. paper, and that included 240 males and 236 females, wide range of socioeconomic status, few measurements during the first 3 months.  The target ages were 1, 3, 6, 9, 12, 18 and 24 months.  Most of the children were measured within 3 weeks of the target ages, and there were some missed points, and they arrived at these ages by mathematic curve fitting.  And there are other people who know a lot more about that here than I know.

    So the Iowa Fels data, we ignored the Fels data during the first 3 months, so it doesn't matter whether they were breast fed or formula fed.  There were, for the first 3 months, just Iowa data, 580 males and 562 females.  The reason that's bigger than the numbers I showed you before was what I showed you before was formula fed.  We included breast fed in this, and I'll tell you why.

    Then we used both Iowa and Fels data, having established that the Iowa Fels size at 3 months was very similar.  We used both for 3 to 6 months of age, and then we used only Fels data for 6 to 24 months of age.

    So those are the Iowa data and the Iowa Fels data.

    The Iowa data and the Iowa Fels data and most other referenced data, except those for international comparisons, have been developed to detect abnormalities of infants, of individual infants, and to detect abnormalities of individual infants you have greatest interest in the outlying centiles.  The more individuals you have, the more confidence you have in those outlying centiles. What you want is early detection of growth abnormalities.  Growth will not tell you whether a child is normal or not, but it gives the most important single clue to telling you that that baby is one that needs closer attention than the general garden variety baby.

    And for that reason, weight gain is more important than length gain, and that's because it gives you this clue much earlier than change in length.  I think I have that on the next slide.  And you need data for at least the first 2 years.  Iowa data are only good, up to at best, 196 days.

    I have to go back, but I don't know how to do that, so I'll tell you that--can you go back?  Just go on back.

    The weight gain is more important than length gain because as I said, weight gain gives you the clue earlier.  Length gain is very difficult to measure accurately enough to be useful for determining changes in length the way it is done in hospitals, clinics and doctors' offices.  So length is not really a very feasible way, and moreover, there are very few instances in which length gain will be abnormal and weight gain normal.  So weight is the most important thing.

    Now, when you evaluate an infant formula--we didn't think about this until much after we published most of the Iowa growth data--you have different criteria for what you need as reference data, and the characteristics are it should be longitudinal.  It's difficult for me to agree that you can use cross-sectional data as a sensitive way of analyzing longitudinal data.  If you're doing an infant formula study, you're examining how the infants grow over the period of study.  For that you need longitudinal reference data.  It should be gender specific.  I haven't heard any argument about that.

    The study integral must include at least part of the neonatal growth spurt, should include all of it or most of it.  Neonatal growth spurt is from 8 to 42, maybe 8 to 112 days of age.  After 112 days of age growth rates are substantially less, and we'll come back to that.  And you need length data as well as weight data because it is possible that you would find--and I'll give you an example of this--a situation in which babies would grow normally in weight or maybe super normally in weight, and the weight to length would be outside of what we see with usual infant formula, suggesting that maybe this formula is not fully adequate.

    The reference population should be similar to the study core, and that's always going to be a problem.  It will never be exactly like the study core, but the question is, how close can you get and how close do you need to get?

    I missed the last one.  If I knew how to use this, probably I'd know how to use my camera too.

    [Laughter.]

    DR. FOMON:  The duration of study should be at least 84 days.  That's a new minimum length.  I made up the old 3 months minimum length that's in the AAP report, and I didn't have any good basis for that, but I think that maybe you can agree to 84 days as well as you can agree to some slightly longer figure, and we'll come back to it.

    I said this.  The most sensitive evaluation of the longitudinal growth study of a cohort in the longitudinal growth study requires longitudinal reference data.  I don't say that this is gospel.  It's just what I believe. Gains in weight and length are more rapid in infant males than in infant females.  The formula may be adequate for females but not for males.  Nobody's arguing about the gender anyway.

    The study integral must include at least part of the neonatal growth spurt.  A formula may be adequate for older infants but not for younger infants.  The reason is that during the period of most rapid growth the ratio of specific nutrient  to energy is highest, and if you get beyond that period, the ratio of protein or calcium or whatever to energy may be down at a lower level.  So if you start a study at 4 months and you get 4 or 5 months of additional data, the formula may be fine, but it doesn't tell you that it will be fine starting at 8 days or at birth.

    Here is an example. These are males, and this is weight gain from 8 to 56 days, and we did a study of a relatively low protein diet based on isolated soy protein with or without a methionine supplement.  We were interested in getting a fix on the requirements for sulphur containing amino acids.  And with the methionine supplement the gain was 42.3 grams per day, and with no methionine supplement it was 38.8 grams per day.  Compared to the reference data, 8 to 56 days, this was not significant and this was significant.  From 56 to 112 days there was no difference.

    I wish I had 3 or 4 more studies to demonstrate this, but this suggests at least that you need to have an early portion where you have the maximum postnatal growth included in your evaluation of a formula.  You need data on length as well as weight, and this was the best example.  I think if I spent more time I could find more examples and then it would be more convincing, but I couldn't do that, because I had to work in my yard.

    [Laughter.]

    DR. FOMON:  These are males and this is the BMI, and this is a low-protein formula and this is the reference.  And at 8 days, when we enrolled them, the low-protein, the cohort receiving the low-protein formula had a BMI of 13.  That was significantly less than the reference data.  At 112 days the low-protein cohort had a BMI of 18.6 which was significantly greater than the reference data.  And all our data, including--if you pardon my expression--skinfolds, indicated that these babies were fat.  And what we speculate is that they were--and they took more volume.  We, in all our studies, record how much the babies eat.  We weigh the bottles in and the bottles out, and they took more energy in and they gained more weight, and we speculated that they ate more because of the low-protein content.  They were eating--I don't know how they knew how to do it--but they were eating more to make up for the low-protein concentration in the diet.  They got enough protein.  They grew normally in length.

    The study cohort must be similar to the reference cohort.  They should be healthy, that is they shouldn't include babies with illness.  They should be term if what you want to know is, is your formula going to be adequate for term infants.  You don't want to increase the noise in your experiment by adding preterm infants.  And then the question of ethnicity, terribly important in international studies, may not be so important in the United States where you're comparing what seems to be similar groups, but that's a question that needs to be carefully examined for each study.

    So the duration of study should be at least 84 days.  I think that 8 to 112 days, which is why our data are mostly 8 to 112 days, or 14 to 112 days, almost as good I think.  At 8 days, there are great advantages of 8 days.  Most formula-fed babies have regained their birth weight.  You can get a really good measurement of length at 8 days.  You're not there to get it when the baby is born, and even if you are there, the hardest time to get an accurate measurement on a baby is at birth.  Howard Meredith, many years ago, showed me some publications on how very hard it was to get an adequate length at birth.

    Now, I think that in the current U.S. climate, where you can hardly recruit any formula-fed babies before 42 days of age, which is still fairly easy, but you may be able to recruit them at 28 days.  Later than 28 days you miss too much of the postnatal growth spurt.  Earlier it's too hard to recruit, so maybe 28 days is the most feasible, and 28 to 112 is 84 days, and that's how I came to 84 days.

    Now, if I give you my recommendations to the FDA, straight out, no hedging, I would say that size data are not relevant.  I mean I know half the people in the audience hate me.  Size data are not relevant.  Data over 6 months of age are not relevant.  I'm not even sure that data over 4 months of age are relevant.  Breast-fed babies are not relevant, but that's not so serious because during the period we talk about they gain about the same.  I just don't like to muddy up a study of formula-fed infants with a mixed group that I can control.  And unless you have--you must match the cohort with the reference group.  If your reference group consists of term infants, then you can't muddy it up by including infants, preterm infants in any number that they might be present.

    So those are my messages and I'm willing to take the flak because I've done it before.

    DR. GARZA:  Thank you very much, Dr. Fomon.  I regret the miscommunication between FDA and the speakers.  We'll try to get that resolved for you in terms of the 20 versus 15 minutes.

    Are there any questions?  Dr. Anderson?

    DR. ANDERSON:  Anderson.  I understand that the Iowa Fels data are longitudinal data.

    DR. FOMON:  Absolutely.

    DR. ANDERSON:  And that the recently published CDC standard is largely from cross-sectional data.

    DR. FOMON:  Absolutely.

    DR. ANDERSON:  To what extent do the percentiles generated from the two sources differ in substantive ways?

    DR. FOMON:  I don't know that offhand.  I think from the point of view of infant formula evaluation, that it's not relevant because you can't get good analysis of longitudinal data from a cohort under study by comparing with size data.  In 1976, just to prove my seniority, I had a dialogue with Peter Hamill [ph] over about 9 months, trying to convince him to call the NCHS charts size charts so that people wouldn't be confused by thinking that they're growth charts, but I lost that argument.

    DR. GARZA:  Dr. Stallings?

    DR. STALLINGS:  Dr. Fomon, I'm interested in your opinion of a couple things.  One, the need for a control group, a contemporary control group when you're actually doing a study, you know, particularly for the kind of things we would be looking at, a change in formulation.

    And secondly, when you said that you thought the breast-fed baby didn't have any role in thinking about evaluation of formula-fed babies, I think rather than is breast feeding--the growth pattern of breast-fed babies a pattern that should be strived for with formula-fed babies?

    DR. FOMON:  Well, let's see, question number 1.  Tell me question number 1 again.

    DR. STALLINGS:  About a contemporary control group.

    DR. FOMON:  I think there are circumstances under which a concurrent control is essential, and one was the study that I mentioned to you, where we had a formula, a low-protein formula fortified with methionine or not fortified with methionine.  In that case we had to demonstrate that with methionine it maps the reference group.  But I think in general it's probably not really essential if you have good reference data for comparison, don't think it's really essential to have a concurrent control.  It greatly increase the number.  The number that you need to compare one cohort with the reference data is considerably less than the number that you need if you're going to compare it with a concurrent control.

    So just from the practicality of making it possible at all to study new formulas, I think it's a reasonable compromise not to require a concurrent control unless something about the ingredient change or whatever suggests that a concurrent control would be valuable.

    And on the other question, should a cohort fed a new infant formula be compared to a breast fed control reference group, I think that's more a philosophic matter than a scientific matter, and my own conclusion is that if you want to study a new formula, you should study it in comparison with old formulas, and not with some group that we think might represent ideal growth.  That's just what I think.  Other people think other ways.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  Thank you very much.

    It is 5 minutes to 12:00.  We're going to try to make up the 20 minutes from this afternoon's schedule because we've been running a bit over time in the presentations, so that rather than coming back at 1:35 as the schedule suggests, I'm going to ask people to come back at about 1:10.  So that we can make sure we can started by that time, having everybody here at 1:00 o'clock would be ideal.  So we will ask the Committee to reconvene at 1:00 and we'll get started after that as possible.

    Lunch for the Committee and the speakers is in the room where you had coffee, and the cafeteria is available to everyone else.  1:00 o'clock.

    [Whereupon, at 11:57 a.m., there was a luncheon recess.]


A F T E R N O O N   S E S S I O N

(1:07 p.m.)

    DR. GARZA:  We can get started.  Our next speaker--can I have all our guests please take their seats?  I think the Committee is at the table.

    Our next speaker is Dr. Jon Tyson, who is a professor of pediatrics, obstetrics, internal medicine and epidemiology at the University of Texas Medical School in Houston and the School of Public Health.  Dr. Tyson will not be able to stay with us tomorrow, so I want to make sure that the Committee members ask all questions or clarify any outstanding issues that they might have related to the topic of growth data for preterm infants, because in fact Jon will be leaving soon after his presentation.  So it's important that you try to get your questions to him before he returns to Houston.

    Jon, thank you very much for coming.

    DR. TYSON:  Thank you very much.

    Well, as a neonatologist and epidemiologist, I'm going to try to make the case today that if what you mean by normal growth is desirable or healthy growth, that the evaluation of early growth in preterm infants will necessarily involve evaluation of health and development.  I'm going to try to go through this in a orderly fashion to promote a rational and evidence-based decision making.

    The first question is:  can the growth rate sustained by a new formula be adequately assessed using published growth norms?

    This is a growth curve from the Neonatal Research Network, observed rates of growth, are weight--sizes, Dr. Fomon would say--in babies according to their birth weight.  Throughout their hospital stay there is serial information assessed.  Also in length and head circumference and mid-arm circumference for these babies.  There's no data for babies greater than 1,500 grams and no data beyond discharge.  I would add that there in my little handout, there's a website that you can go to where you can, for an individual baby, enter the measurements at birth, and print out a growth curve for that particular baby.

    For preterm babies beyond discharge, this shows what I think is the best available data from the Infant Health and Development Program which had 985 preterm babies, and it provides data for 3 groups:  2,000 to 2,500 grams, 1,250 to 2,000 grams, and less than 1,250, plotted according to post-conceptual age--I think that's really post-menstrual age--up to 3 years of age.

    Now, do these both norms describe normal growth?  And I think we have to be really careful what we mean when we say normal values, because I think that's a term that often causes confusion.  I actually try to avoid that word.  I see it used sometimes as referring to values that are expected or typical, typical values, values that are not associated with adverse outcomes or low-risk values, values that do not cause adverse outcomes, healthy or optimal values, and values for which intervention has not been demonstrated to be beneficial.  I think it's really important that we keep clear what we're talking about.  At best these growth grids describe what are typical values.

    How should they be used?  I think that they're a useful clinical tool to assess the growth pattern of individual babies.  Whether they are appropriate regulatory standard to evaluating the formula, I'm sure the answer to that is no.  They provide neither the optimal values nor an adequate basis to compare the growth with new formula to conventional formulas.

    As I've spelled out in the handout, if you compare the growth for the new formula to one of these growth grids, what you describe as an effect on growth might be due to any of a large number of factors, including intervening changes in care and outcome since the growth grid was developed, the fact that in most studies you're using selected patients, whereas the growth grids are based on all patients; a myriad of differences between centers, and the opportunity for bias in patient care, selection, care and assessment in evaluating new formulas.  And I think it's particularly important when the sponsor or the investigator has a financial or even a professional interest in the outcome of the studies to attempt to avoid bias.

    I think we also have to ask ourselves whether the statistical tests that are commonly done in evaluating growth studies are misused and then misinterpreted in assessing interventions using historical controls.  The babies in feeding studies and the babies in these norms that we see for preterm babies and for term babies, are clearly not a random or even a representative population of the sample of the same population.  Moreover, a p-value of less than .05 is often taken to mean that the difference is due to the intervention, when it may be due to any of a number of factors a difference in population, differences in the way the populations are assessed, et cetera.

    So I think we should be asking the question, why use historical controls at all to evaluate new formulas for preterm infants or for term infants as well.  For every other intervention that we talk of in medical care, the randomized trial is the gold standard.  The concurrent cohort, carefully done, carefully studies, is a silver standard.  Historical control is a bronze standard.  Why do we want to use the bronze standard?  And these are also issues in concurrent controls as well.

    Should carefully designed randomized trials be required?  I think the answer to that is clearly yes with a number of other features designed to minimize random error or systematic error and increase the signal-to-noise ratio, which would include mass caregivers and evaluators, well-standardized evaluations shown to be reliable by the people who do the assessments in the study, effective procedures to avoid attrition, and intention to treat analysis predefine stopping rules in an adequate sample size.  And I would add to that I think a commitment to publish the data at least on the website, no matter what the data show.

    Now, there has been opposition to the use of clinical trials by formula companies that would be charged with this responsibility, and I think part of that opposition is the expense and the feasibility of such trials, and I think with the progress in organization of research effort, that this is more feasible now with lots of neonatal research networks out there, some that already include follow up evaluations in at least some if not all the centers, and by the recognition that you don't--you need only do simple management trials.

    Most people, when they think about randomized trials are very expensive, are thinking about the usual traditional kind of explanatory trial, which are designed to determine whether therapies work in ideal or restricted circumstances, or that are designed to define the mechanisms of action.  Management trials, on the other hand, or so-called effectiveness trials, are designed to determine whether therapies work under routine clinical circumstances, so all the effort that goes into trying to control all the co-interventions in explanatory trials is inappropriate in a management trial.

    Who should be enrolled?  Who should be excluded?  I think you want to enroll representative sample of the babies for whom the formula is intended or at least the highest risk group.  So you want to include any babies who make up an important part of that population, small for gestational age babies, very sick babies, twins, et cetera.  You would want to exclude relatively few infants, say the babies who have major congenital anomalies or overt nonbacterial infections.

    Should infants fed their mothers' milk be included?  I don't think it's absolutely necessary, but I think it's highly desirable to increase the generalizability of the results of the trial because a large proportion of all preterm babies are fed at least some of their mother's milk, and also to help identify limitations of the formula, and areas for potential advances based on the benefits of mother's milk over formula after adjusting as best feasible for other factors.

    What assessments should be performed?  I think, obviously, body composition or biochemical, physiologic or functional variables need be considered, but most of the time those would have been studies in prior explanatory trials.  There might be some need to get some of that in some of the patients though.

    What about health outcomes?  Well, I think one variable that has to be considered is the percent of infants with necrotizing enterocolitis.  This is a serious disorder with a mortality exceeding 50 percent in surgically treated extremely low-birth weight babies, and it may well be related to feeding.  Death is a competing variable for necrotizing enterocolitis.  You have to live long enough to get NEC, so you would need also to look at the composite outcome of death or necrotizing enterocolitis.  We also have to be worried when we feed babies with chronic lung disease that rapid growth may not be attainable or even desirable.  If you have marginal pulmonary sufficiency, how fast do you want that baby to grow?  So you would like to know about the combination of death or prolonged mechanical ventilation.

    Neuro developmental outcomes I think are at least highly desirable if not mandatory, because first growth and development may be differentially affected.  You need to exclude adverse outcomes on development, even in the presence of good growth rates.  I think also we need this information to better define the optimal growth rate and the appropriate goals for growth rates sustained by formulas for infants with or without serious illness.

    While growth assessments, weight, length, head circumference and weight-length ratio I think are essential, there may be others as well.  What minimum period of assessment is needed, I think we should remember that a reliable identification of major neural developmental impairments is probably not possible any earlier than 18 months adjusted age, that is, post term, and this would allow evaluation of potential late effects beneficial or hazardous on time-limited interventions given in the NICU or later.

    What standard should be used in judging the growth of preterm infants fed new formula?  The American Academy of Pediatrics has said that the goal should be to achieve rates in an extrauterine environment like those that would have been achieve in utero had the baby not been delivered early.  But we have to wonder if this is really the right goal.  We have some uncertainty about what this rate is currently and we can discuss why.  If we can only measure gestational age well, we could do this better.  Currently we think that it's about 15 to 17 grams per kilo per day weight gain, about 1.1 centimeters per week in length, and about .7 centimeters increase in head circumference.  We again need to ask is this an appropriate goal for infants with severe lung disease, and we also had this observation of persistent growth deficits after reaching full feedings.  So even though we can get babies to grow rapidly, once they get to full feedings, there's this long period of time when they're growing poorly as they recover from illness after birth, and this shows you data for the neonatal research network for babies of different gestational ages, 24 to 25 weeks, 26 to 27 or 28 to 29 in relationship to a so-called growth grid that Alexander published.  And you can see that the babies don't do that bad once they start growing and taking a full intake, but they end up with most of them smaller than the 10th percentile for babies developing in utero with the same gestational age.

    Whatever goal we try to take we have to think of it as provisional, but we could ask, should the current standard for judging preterm formulas be the formula that sustains the best catch-up growth, and that could be that the weight, length and head circumference and the body proportions would be most like that of term infants of the same adjusted age, providing there were no adverse effects on the health or development through 18 months as identified in a well-designed trial.

    How many infants would you need to study to assess a new formula?  This is a really complex and important generic issue in assessing intervention, any intervention where there may be an uncommon but serious potential hazard like necrotizing enterocolitis, and I'm going to spend some time on this even though this may seem to you like a statistical issue only, I think it's an important practical issue, because the kind of things I've said would make feeding studies so large that formula companies or indeed the NIH may be unwilling to fund these.  So I want to try and see if we can find some way to address this.

    So somebody may say, well, wait a minute.  The old formulas have not actually been tested that well, and I've got a new formula here that has strong a priori evidence and rationale for using it.  Say it has a component that's provided before birth across the placenta and in human milk after birth, not given in prior formulas.  It's not well synthesized from precursors in preterm babies.  And we think it's important for healthy development.

    I think even in that circumstance you still need to rule out the possibility that there are important unrecognized hazards of this formula and I'm going to try to list what I think those are or the most important.

    The first would be an absolute increase of at least 3 to 7 percent or more in major adverse neonatal outcomes, particularly necrotizing enterocolitis.  A 3 percent absolute increase corresponds to what's called the number needed to treat of 33.  That is, for every 33 babies that you this formula to, you would cause one baby to have necrotizing enterocolitis.  I think that would be unacceptable even if all the other babies benefited in growth or perhaps even in development.

    A second would be a reduction in developmental quotient at 18 months of a quarter of the standard deviation or more.  That's the mean developmental quotient of 18 months and reduction of a quarter of a standard deviation or more.  If you observe that, that would substantially increase the number of preterm infants with a deficient or marginal IQ that would be eligible later for educational intervention programs.  In the neonatal network this would correspond to a reduction of almost 5 points on either of the Bayley subscores.

    Third would be a reduction of a quarter of a standard deviation in length or head circumference at 18 months.  And this of course is arbitrary, but at least after recovery from serious illness, there's no apparent benefit of slow growth so that I would think a modest decrease in length or head circumference, if not weight, could be seen as presumptive evidence of harm.  And this would correspond in the neonatal network in 18 months to about 250 grams in weight, 1-1/4 centimeters in length and a half a centimeter in head circumferences.

    Now, you may think that this is too small to look at, but I would reassure you that the sample size needed to assess necrotizing enterocolitis, if you use a sample size that's large enough for that, you can evaluate very small effects on growth.

    Now, if you take a conventional approach to sample size, you would need 315 per group to have 80 percent power to identify a quarter of a standard deviation difference in either development or growth or size at 18 months, and an alpha error of .05, assuming you lose fewer than 20 percent of kids to follow up.  The power to identify an increase in necrotizing enterocolitis would be 78 percent, for a large increase, 7 percent, that would be a doubling of the right of necrotizing enterocolitis in the neonatal research network.  It would fall to only 22 percent for a 3 percent increase, so a really small power to look at a clinically important increase.  If you said you'd like 90 percent power to identify a quarter of a standard deviation difference of 18 months, you'd need 421 per group.  Your power to identify an increase in NEC would still be only 30 percent for a 3 percent increase in NEC.

    What can you do about this?  One potential way to address this is a non-inferiority trial.  For the sake of time I'm not going to talk about that.

    Another, and I think this is, practically speaking, the most attractive option, is to increase the p-value considered statistically significant in evaluating a serious hazard.  As you know, the same p-value, usually p less than .05 is used for benefits and hazards in studies, and this is an arbitrary and not well justified practice.  For a serious hazard like necrotizing enterocolitis, a higher p-value might be justified on multiple grounds.  First is, we know in clinical studies the direction of bias is toward finding benefits rather than looking for harms.  There's a lot more effort put into it in general, and the studies are powered to evaluate benefit rather than harm usually.  But the hazard may be much more important than the benefit.  And as pragmatic evidence, we know that data safety monitoring committees that review the accruing evidence in clinical trials will stop a clinical trial at a much higher value of p for hazard than for benefit.

    The appropriate p value shouldn't depend in part on the cost of drawing the wrong conclusion.  For a serious hazard like NEC, I would contend that we might select a p less than .30.  That would still result in a 70 percent chance or higher that a difference of that magnitude would not occur by chance under the null hypothesis.  If you did this, what you're doing is you're increasing the risk of a false positive conclusion, that is, that you would conclude that the formula causes NEC when in fact it doesn't.  In order to reduce the risk of a false negative conclusion, that is, a conclusion that the formula doesn't cause NEC when in fact it does.

    So if we go back to the numbers we calculated before for benefit, at 315 infants per group, again, that was for 80 percent to look at a .25 SD difference, the power to identify an increase in NEC would be 96 percent very high power for a large increase, 58 percent for a 3 percent increase.  So you're slightly more than 50 percent likely to identify it.  If you use 421 per group, you then get down to a power of about 2 in 3 to identify a 3 percent increase.

    If you found hazards at a p of .30 and benefits at a p less than .05, what would you do?   Well, I think you wouldn't recommend the formula, you'd recommend further study, and that would be a departure from what has been done in usual practice.

    I'm going to skip that one, and just conclude by saying that I hope I've convinced you that the growth of preterm infants should not be assessed in isolation from effects on health and development, that a large trial evaluating growth health and development to 18 months or more is needed to assure that the benefits of any new formula outweigh any hazards in preterm babies and to better define the effects of different growth rates, and the growth rate that we should be looking for in deciding how to design preterm formulas.

    Thank you.

    DR. GARZA:  Thank you very much.  Questions or comments?  Dr. Stallings?

    DR. STALLINGS:  It sounds like you've silenced us pretty well.  You know, we at the beginning, talked about dividing up preterm from term infants completely, and it sounds like, from your presentation, that you really, in the area of growth, that there really isn't anything you learned from term studies that would influence you on preterm.  Would that be a fair--

    DR. TYSON:  Well, I wouldn't say wouldn't influence you, but I don't think you can determine whether a new formula is appropriate for a preterm infant based on observations in term infants.

    DR. STALLINGS:  The other thing I'd like for you to elaborate on a little bit is I think it's often that--you were telling us a bit about who the sample should be, and in essence, the inclusion/exclusion criteria.  Which infants, if you elaborate, which infants should not be in a growth study?  Which preterm infants should not be in a growth study of preterm?

    DR. TYSON:  Well, I think it would be babies with the kind of problem that's very unusual, and that's going to have an overwhelming effect on growth like trisomy 13, growth and mortality, nonbacterial infection.  Other than those things, I think you're talking about 3 percent of babies or something.  The rest of them I would vote to include.

    DR. THUREEN:  Thureen.  Dr. Tyson, I know that in your paper you said you would include growth restricted infants as part of this because they're such a large portion of the population, but that you would substratify those infants for further analysis.

    DR. TYSON:  Yes, right.  You can of course include explanatory evaluations within a management trial, so it might be that that formula has a different effect on those babies.

    DR. THUREEN:  In terms of neuro developmental outcome, do you think it would be fair to exclude infants who had had very high risk factors for significant neuro developmental outcomes, such as intracranial hemorrhage, prolonged asphyxia, evidence of white-matter disease, before the trial even started?

    DR. TYSON:  If the formula is going to be fed to babies with severe asphyxia, then I think you would want to test it in those babies.  For some of those conditions you had mentioned, they would occur after you started the feeding, so like cystic white matter disease you might not identify till 36 weeks post conceptual age or something, and that's really, that's potentially an outcome variable.

    DR. THUREEN:  Would you pair match those infants then at all with other infants who had similar risk factors or known disease that affects neuro developmental outcome, or would you just do a purely prospective randomized trial?

    DR. TYSON:  If you do a large randomized trial, first of all, it gets really cumbersome to try and pair anybody at birth.  As long as you're stratifying by center and maybe a couple of other things like birth weight less than 750, 750 to 1,000, something like that, that you will end up with an approximately equal number of those babies in the two groups, and then you can go back and do an analysis.  If you try to stratify for birth weight, SGA, gender, birth asphyxia, et cetera, you end up with so many huge strata that the study gets really complicated to do.  And I don't feel as strongly about that as most statisticians, but my understanding of the school, most statisticians are towards the minimal prognostic stratification at randomization, and more toward post hoc looking at individual groups who should have been predefined ahead of time which group you were going to look at.  Does that answer your question?

    DR. THUREEN:  That makes sense.  And would you change any of your ideas about how to conduct a study if you are going to look at patients who this is their exclusive formula fed from initial feeding versus studies started when infants really attained full feeding?  Do you think that it makes any difference on how you conduct the study if you're looking at those two issues?  Did that make sense?  Because yours sound like you're referring to infants who may start minimal enteral feedings with the study of formula, rather than waiting until they attain full feeding and then starting from that standpoint?  Do you think it's preferable to do one or the other, or do all of your idea really apply to--

    DR. TYSON:  It depends on when the formula is going to bed.  If it's a formula that's going to be fed in the first week or something like that, I think you want to test it as it's going to be used.  Let's say that it was a formula that was recommended for us from the first feeding.  Let's say for the sake of argument that it cause necrotizing enterocolitis, and you didn't enroll baby, and you didn't start that formula until 3 weeks of age or something, or at a point when the babies were on full feedings, you might miss that effect, so you want to test it as it's going to be used in the real world.  Does that make sense?

    DR. THUREEN:  Yes.  And then lastly, do you believe that there are really no good reference standards for growth in the preterm infant or at least a certain subgroup of preterm infants that may be extremely low birth weight?

    DR. TYSON:  In the Neonatal Research Network, we have research nurses that are doing standardized--that have done standardized evaluations of anthropometry in intervals.  There are huge center differences.  If you try to take the data from any center to apply it to another center, you could easily be mislead just by the center differences.  So I don't see why you would want to use comparisons that would involve center differences or time differences.  This was gathered data 3 years ago when they were using steroids, postnatal steroids more often or less often than they are now.  Why not randomize and get the cleanest--I think the belief that you don't have to use controls, you don't have to use randomized controls, that you can answer the question with fewer patients is an illusion, that at a given number of patients your ability to get a unbiased answer to the question is going to be greater with randomized controls than with historical controls.

    DR. THUREEN:  Thank you.

    DR. GARZA:  Heubi?

    DR. HEUBI:  I think, Jon, this is all very interesting.  I wanted to actually ask you a couple of questions, and you'd have to follow my line of thought here.

    The number of subjects that you would entertain as being appropriate for a study is about 10 times what a typical current formula study would be.

    DR. TYSON:  Right.

    DR. HEUBI:  With that in mind, knowing what you knowledge is of the Neonatal Network, would the Neonatal Network sponsor studies like this because this is specific to preterm infants and it would be a potential rationale to study with partnering between industry and NIH money to do studies like this?

    DR. TYSON:  The Neonatal Network has a protocol review committee and standard procedures for--there's no reason that couldn't be proposed and seriously considered or accepted if it went through all those things.  There are lots of networks out there.  In Texas we've started a Texas network, and there's an Oxford network, and the Canadians have a network, and the Australians have a network, and I'm sure there's networks developing in the United Kingdom if they're not already in place.  So there are a lot of people willing to do this.  The hardest part is going to be the 18-month follow up.  That's a lot harder than studying NEC, but as more and more people say, "If I'm going to take care of babies this size, I need to know how they turn out.  That means I have to have a really well functioning follow up system.  So I think there are going to be people out there that can do it at much lower cost than if you just went to them on day one and said, "We're going to fund your whole follow-up effort in order to answer this one question.

    DR. HEUBI:  But I was looking at it from the standpoint of it being economically more attractive to industry to do studies through the network that exists through the NIH because some of the infrastructure already existed and was already being paid for in part by federal money.

    DR. TYSON:  Right.  Kathleen Kennedy and I proposed to the network a feeding study, and one of the things--and I'm glad I was involved in this effort--the business about the p-values that I presented today, that was aware stumbling block to us when we got to the--the reviewers really liked it and the statisticians said, "Well, you're going to have to study 6,000 babies or something," and I was working under the same mindset I had been before, well, that's if you want to look at a p less than .05.  But why not accept a higher p-value, or as I was going to say on the last slide, predefine what you think is an acceptable ratio of the number of babies who benefit to the number of babies that are going to be harmed, and then test that and say, does the number of babies who are helped by improved growth or development,  relative to the number who are harmed by NEC or worsening BPD, if indeed that's a hazard, is that an acceptable ratio or not?  And try to design studies not to look at one outcome variable but at the relationship of one or two variables or perhaps even more.  And I think that's a cutting edge issue in the experimental design that the time is ripe to do now.

    DR. HEUBI:  And this is a circumstance where clearly DSMB or some monitoring board, during the--

    DR. TYSON:  Yes.

    DR. HEUBI:  --would be pretty accepted.

    DR. TYSON:  Right.

    DR. DENNE:  Jon, you've made an argument about following preterm infants out to 18 months for neuro developmental outcome and I understand the rationale for that argument.  How do you feel about similar studies in terminants?

    DR. TYSON:  Do you mean randomized trials or--

    DR. DENNE:  No.  I mean the necessity for evaluating neuro developmental outcome at 18 months in studies of new term formulas?

    DR. TYSON:  I don't see why not.  I mean everybody in this room thinks nutrition's really important.  You take these formulas and you feed them to millions of babies.  Why test it in only 50 or 60 babies?  Why allow it to go on the market without knowing does it have beneficial or harmful effects as far as you can tell with an appropriate sized study in term babies?

    DR. ANDERSON:  Anderson.  In your discussion of NEC much of the calculations were done based on a baseline rate of 7 percent.  How would you feel about some of the adverse event monitoring being done not necessarily in the context of a randomized clinical trial, but against some fixed standard?  That is an infant formula would be unacceptable if it produced a rate of NEC above 10 percent.

    DR. TYSON:  There are several problems with that.  One is there is a lot of institution variation in NEC and with any institutions, there are periods when the NEC rate really goes up, and when it comes down, nobody quite understands that.

    And finally you have the potential problem of bias.  There have been studies, for example, they took x-rays of kids with NEC and x-rays of kids thought not to have NEC, going to every pediatric radiologist in California, and they found this incredible variability in what was called NEC and what wasn't.  And so when you're in the context of a study like that, I just think the opportunity for bias is there, and that your ability to relate that to some other institution in a different point in time I just don't think is worth the effort.

    DR. STALLINGS:  Stallings.  A slightly different angle.  And we talked this morning a little bit about, or inferred a little bit about term babies growing too fast, and you know, that's of concern, but certainly for my clinical time, the worry over preterm babies growing too fast, and I can remember bedside debates about too fast and it's only fat and it's no brain and it's no muscle and all of that.  I don't think we have nearly as good a handle on the body composition component of the preterm babies.  And then you add to all of that the concerns that we're all reading more and more about, is early postnatal growth a very--I mean we know it's an important time, but a differently very important time in lifelong health.  Would you make a few comments?  I know you made the caveat about babies with really chronic lung disease and concern, and that's really related to CO2 retention, but put those babies aside, and can we grow preterm babies too fast, and how do we determine when we're approaching that?

    DR. TYSON:  Well, I clearly don't have the answer to those questions.  It seems to me the only way we can get it is to randomize babies to different feeding regimens that produce different growth rates and see who turns out to have the best h