Assessment in WIA Youth Programs |
||||||||
Youthwork Information Brief No. 24 |
||||||||
Cheryl R. Sturko Grossman, Michael E. Wonacott, and Diana Jackson |
||||||||
Why Learn about Assessment? |
||||||||
Assessment plays a key role in fulfilling the purpose of the Workforce Investment Act (WIA), which is to help at-risk youth make successful transitions to further education and employment in high-demand, high-wage occupations and careers. |
||||||||
In the first stage of a youth’s involvement in WIA program activities, an objective assessment is mandated to identify important baseline information about the youth: |
||||||||
|
||||||||
Assessment of WIA youth participants is not a one-time task, done and then forgotten. Rather, the results of assessment are used to design a specific program of WIA youth services, the Individual Service Strategy (ISS), that will address the youth’s barriers and needs and lead to individualized educational and employment goals |
||||||||
In addition, the results of objective assessment are a baseline against which the youth’s progress can be measured; they serve as a pre-test. As the youth receives the services listed in the ISS, ongoing assessment identifies gains and improvements in academic levels, skill levels, and developmental needs. If there are gains and improvements, the ISS should evolve to meet the youth’s evolving needs, ensuring that ongoing services remain appropriate to the youth. |
||||||||
Finally, statutory WIA Performance Measures and new Common Measures require periodic reassessment of youth skills |
||||||||
|
||||||||
|
||||||||
Initial Assessment and Objective Assessment |
||||||||
Following intake, WIA youth staff gather and document information to identify the youth’s barriers and to determine if WIA youth services are appropriate. Information is also gathered to help determine a youth’s needs for supportive services such as transportation, child care, dependent care, medical services, housing, and work clothing or tools. This process is initial assessment, a form of informal assessment, and does not initiate participation in WIA. |
||||||||
Objective assessment, on the other hand, does initiate participation in WIA and involves formal assessment. There are many types of formal assessments used in WIA youth objective assessment. The most familiar type of formal assessment is the achievement test. This type of assessment measures what a youth already knows or can do. Examples include a math test or a basic skills test. |
||||||||
|
||||||||
Another type of achievement test is a skill or performance test, which requires test-takers to demonstrate a skill or procedure, rather than just demonstrate knowledge about it. An occupational skills test is one example. |
||||||||
Other formal assessments measure traits, characteristics, interests, or aptitudes related to work and careers, like a career interest survey, a vocational aptitude battery, a life skills inventory, a learning style assessment, a measure of depression, or an intelligence scale. |
||||||||
The Ohio Department of Job and Family Services (ODJFS) mandates the use of specific basic skills assessments for objective assessment of WIA out-of-school youth; ODJFS also strongly recommends a specific instrument, TABE 9/10, for the required basic skills assessment for in-school youth. Local Workforce Investment Boards (WIBs) are free to select appropriate assessments for traits, characteristics, interests, or aptitudes related to work and careers. Assessments given by schools or other agencies up to six months prior to date of participation may be used for literacy/numeracy measures for both older and younger youth. |
||||||||
|
||||||||
Why Is Assessment a Challenge? |
||||||||
WIA youth staff may not be specialists in assessment, evaluation, or psychometrics and may find assessment to be a challenge. What is the right assessment to use? How should it be used? Who should administer it? How should it be administered? Who should interpret it? How should it be interpreted? How should it be used? |
||||||||
Fortunately, assessment developers and publishers provide a great deal of information and guidance for users of assessments such as WIA staff. The following are some key questions that should be addressed for each assessment to be conducted:
|
||||||||
|
||||||||
Is This An Appropriate Assessment? |
||||||||
Formal assessments are used to measure current skill levels, determine employment-related interests and aptitudes, and diagnose developmental characteristics and needs. |
||||||||
|
||||||||
|
||||||||
|
||||||||
Although WIA youth staff may administer a pre-screening assessment for learning disabilities (with appropriate training), only qualified specialists should make a formal diagnosis of disabilities. In other cases that call for formal assessment, WIA youth staff can use the following guidelines to select an appropriate assessment:
|
||||||||
Use a Criterion-Referenced Assessment, If Possible |
||||||||
Assessments are usually designed to be scored in one of two ways. |
||||||||
|
||||||||
|
||||||||
If possible, WIA youth programs should use criterion-referenced assessments to identify youth’s academic levels, including basic skills levels, and skill levels, including occupational skills. Criterion-referenced assessments provide detailed information on which specific skills youth have and which they don’t have. Norm-referenced assessments, on the other hand, only tell you how one youth compares to other youth – they don’t tell you which skills the youth has and which skills the youth lacks. So, norm-referenced assessments don’t provide the detailed information necessary to identify specific services needed or to measure skill gains over time. |
||||||||
Some assessments, like the TABE 9/10, are both criterion-referenced and norm-referenced. |
||||||||
|
||||||||
Use a Standardized Assessmen |
||||||||
An assessment is standardized when every test-taker has an equal opportunity to accurately demonstrate his or her knowledge or other characteristic being measured. Developers achieve this, in part, by examining individual items and the assessment as a whole to eliminate any possible racial, ethnic, cultural, linguistic, or socioeconomic bias. The assessment is normed against a representative sample of the target population to verify that items are not biased and to provide statistical evidence of quality. |
||||||||
Developers also provide standard instructions for administering the assessment. Instructions are often elaborate, including scripts that must be read verbatim. This ensures that all test-takers are receiving the same directions – if directions are ambiguous, at least they are ambiguous for everyone! |
||||||||
Instructions may also recommend adjustments in assessment procedures for youth with specific disabilities or for youth who speak English as a second language (ESL), along with statistical evidence of the quality of such adjustments. Finally, specific and detailed instructions are provided on scoring the assessment and interpreting results. |
||||||||
Use an Assessment with Evidence of Reliability |
||||||||
Reliability is a critical technical characteristic of a good assessment; an assessment is reliable if it gives consistent results – over time, across different administrations, or across different raters. Positive correlations (+0.6 or greater) provide statistical evidence of reliability. |
||||||||
|
||||||||
Use an Assessment with Evidence of Validity |
||||||||
Validity is another critical technical characteristic of a good assessment; an assessment is valid if it accurately measures what it is intended to measure. Assessments are only valid for the purposes specified by the developer. An assessment that is valid for comparisons between individual test-takers may not be valid if used to compare programs unless the developer has validated it for that purpose. |
||||||||
Expert judgment. The assessment should be a representative sample of the domain of knowledge and skills being assessed, and that is a matter of expert judgment. So, look for an assessment developed or endorsed by authoritative, acknowledged content experts in the domain. |
||||||||
Statistical evidence. Statistical evidence can also demonstrate that an assessment is valid. A valid assessment should show a positive correlation (as one factor increases or decreases so does the other) between +0.6 and +1.0 with:
|
||||||||
On the other hand, a negative correlation, between -0.6 and -1.0, would occur if one factor increases (for instance, a score on a coping skills assessment) as another decreases (for instance, aggressive behavior). Such a negative correlation can also be evidence for validity. |
||||||||
Logical and intuitive appeal. Youth sometimes may not see the connection between the assessment items and the knowledge or skills being assessed. If the items don’t have a logical, intuitive appeal to youth, youth may:
|
||||||||
An assessment administrator should explain the underlying rationale for the assessment to help youth accept its validity. |
||||||||
|
||||||||
Use an Assessment Designed for the Appropriate Target Population |
||||||||
Any assessment selected should be designed specifically to be used by youth like the WIA youth population – ages 14 -21, from a variety of racial, ethnic, cultural, and socioeconomic backgrounds, and with barriers to success. |
||||||||
For example, consider a basic skills reading assessment. If the assessment is written for adults, many items on the assessment may involve aspects of independent adult living – voting, purchasing a house, personal finances, parenting, and so on. A 16-year-old high school dropout might score low on the assessment because the content of the items is unfamiliar – not because he or she doesn’t have the required reading skills. |
||||||||
Finally, some assessments are available in both a short form and a long form; using the short form can help youth perform better. With a long form, youth can lose concentration and become fatigued – or even become so discouraged they exit the program prematurely, especially if they have had bad testing experiences in the past. |
||||||||
|
||||||||
Use the Same Criterion-Referenced Assessment for Pre-Test and Post-Test |
||||||||
A pre-test is used to identify a youth’s skill level at one point in time; a post-test identifies the youth skill level at a later point. The difference between pre-test results and post-test results is the increase in the youth’s skill – the skill gain. In order for those results and gains to be meaningful, the same assessment must be used for both pre-test and post-test. That assessment must also be criterion-referenced in order to provide specific details on skills gained. |
||||||||
|
||||||||
How Should This Assessment Be Administered? |
||||||||
Staff who administer an assessment must be familiar with the administration manual and follow explicit instructions carefully. The instructions are designed to ensure valid and reliable results. To avoid biasing assessment results, all youth must always receive the same exact, full instructions. |
||||||||
Format, Materials, and Equipment |
||||||||
Youth should have an opportunity to familiarize themselves with the format of the assessment and its items and any materials or equipment being used (calculator, scientific apparatus, computer). Computers are increasingly being used for assessment and may be an attractive option for youth. |
||||||||
Most youth are comfortable with computers, but keyboarding, mouse use, and reading lengthy material from a screen may all detract from the knowledge and skills being assessed. If computers are used, policies and procedures should be in place to deal with software, hardware, or power failures. |
||||||||
| Assessments typically provide sample items that test-takers can review, so administrators should go over those sample items to familiarize youth with format and equipment use. However, actual items should never be reviewed – sample items only! | ||||||||
Environmental Factors |
||||||||
Environmental factors can interfere with test-takers’ ability to demonstrate their true knowledge or skills so the physical environment should be controlled as much as possible:
|
||||||||
Administration procedures should be adjusted per assessment instructions to accommodate the needs of youth with disabilities or ESL youth; however, such accommodations should be made in assessment only if they have also been made during instruction. Additional information on accommodations for youth with disabilities is provided in TEGL 17-05. |
||||||||
|
||||||||
How Should This Assessment Be Interpreted And Used? |
||||||||
Assessments should be scored and results should be interpreted and reported as prescribed in the developer’s instructions. |
||||||||
Criterion-Referenced Assessment Results |
||||||||
Raw score. The basis for assessment interpretation is the raw score – usually how many items were answered correctly. The raw score provides only limited information; a score of 20 may be very good or very bad. |
||||||||
Percentage score. In order to make judgments about performance, raw scores are converted into percentages for criterion-referenced assessments. A percentage score is calculated using the number of correctly answered items divided by the total number of items; for instance answering 18 of 20 items correctly achieves a 90 percent score. The percentage shows how closely an individual approaches the criterion or standard. |
||||||||
Cut score. Cut scores, or passing scores, specify the score that is considered proficient for the assessment – a “passing grade,” so to speak. Cut scores may be set using raw scores, percentages, or percentiles. |
||||||||
|
||||||||
Norm-Referenced Assessment Results |
||||||||
Norm-referenced assessments also convert raw scores into percentiles, stanines, and grade equivalents. |
||||||||
Percentile. The percentage differs from a percentile, or percentile rank, used with norm-referenced assessments. A percentile is calculated by determining the percent of scores within a group that fall below a given raw score; for instance, a score in the 90th percentile indicates that 90 percent of other test-takers received the same or lower score. Percentile ranks are not always evenly distributed and do not necessarily indicate quality of response. A test-taker could be in the 99th percentile and have correctly answered only 50 percent of the items, if everyone else in the norm group answered 50 percent or fewer of the items correctly. |
||||||||
Stanines. Because percentiles are not necessarily evenly distributed, scores are often reported in stanines. A stanine is based on a bell curve, or normal distribution of scores, divided into nine units (1 to 9); a score of five indicates average performance. Like all norm-referenced scores, stanines compare an individual score to the scores of other test-takers in a reference group |
||||||||
Grade equivalent. Another way that scores are expressed is the grade equivalent – for example, a 7.6 grade equivalent score on a math test. However, a 7.6 grade equivalent score does not mean that the youth has math skills appropriate to the 6th month of grade 7. Rather, it means that the youth’s raw score was average for youth in the norm group who were in the 6th month of grade 7 – whether those youth had math skills appropriate to that grade level or not. |
||||||||
|
||||||||
Educational Functioning Levels |
||||||||
Under Common Measures, annual literacy and numeracy skills gains for out-of-school youth must be reported in educational functioning levels (EFLs). Each EFL is equal to approximately 2-3 grade levels. WIA youth programs must use basic skills assessments crosswalked to EFLs for Common Measures reporting. Please see TEGL 17-05 for more information regarding EFLs, especially Attachment C. |
||||||||
|
||||||||
Interpreting Assessment Results |
||||||||
Be wary of both perfect scores and scores that approach chance (scores that could be achieved by random guessing); neither of those scores provides an accurate result. A youth who receives a perfect score may be taking an assessment that is too easy; a youth whose scores hover around 50 percent may be taking an assessment that is too difficult. Reassessment with the appropriate level will provide better information about the youth’s specific needs. Some assessments, such as the TABE 9/10, offer brief Locator Tests that help determine the appropriate level to administer. |
||||||||
In some case, it is legitimate to question the accuracy of a youth’s score. If the results of an assessment are inconsistent with other measures of performance such as school grades or another assessment, or if you know that the youth was ill or unduly stressed during the assessment, interpret the results with caution, or, if possible, allow the youth to retake the assessment. |
||||||||
|
||||||||
Using Assessment Results |
||||||||
Use both formal and informal assessment results to aid in developing an ISS and to help youth make educational and career plans. Informal assessment can be incorporated into all staff interactions with youth. Basic skills assessments must be given at least annually; other objective assessments may be given when youth reach an appropriate goal or milestone. |
||||||||
Remember that an assessment, no matter how rigorously developed, administered, and interpreted, only provides limited information about a youth. Never make important decisions solely on the basis of the results of a single assessment; use other methods and results to makes important decisions about a youth’s capabilities, interests, and needs. |
||||||||
|
||||||||
How Do I Ensure That This Assessment Is Conducted Ethically? |
||||||||
Assessment results have consequences for WIA youth programs. Local area youth programs are held to performance standards, under both original WIA Performance Measures and new Common Measures. Two of those performance standards, the Younger Youth Skill Attainment Rate and Literacy/Numeracy Gains, involve specific skill gains that are identified through assessment. Since local areas can be sanctioned for poor performance or rewarded for good performance, temptations can occur. |
||||||||
In addition, it is only human to want youth to succeed, and that understandable desire can lead to “help” that defeats the purpose of the assessment. For example, teaching narrowly to the test may improve results in the short term, but important knowledge and skills for long-term success may be missed. WIA youth staff should never engage in unethical practices in assessment, such as:
|
||||||||
Remember that the ultimate purpose of assessment is to accurately demonstrate what youth know and can do. Information from assessment results can help improve instruction offered to youth and other programming. The best way to ensure good assessment results is to provide an effective program. |
||||||||
|
||||||||
Sources |
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
|
||||||||
![]() |
LearningWork Connection is an initiative of the Center for Learning Excellence at The Ohio State University. |
|||||||
![]() |
![]() |
Youthwork Information Briefs are sponsored by Ohio Department of Job and Family Services - ODJFS, Office of Workforce Development, Bureau of Workforce Services. |
||||||