EP1989561A2 - Methods and apparatus for data analysis - Google Patents
Methods and apparatus for data analysisInfo
- Publication number
- EP1989561A2 EP1989561A2 EP07757164A EP07757164A EP1989561A2 EP 1989561 A2 EP1989561 A2 EP 1989561A2 EP 07757164 A EP07757164 A EP 07757164A EP 07757164 A EP07757164 A EP 07757164A EP 1989561 A2 EP1989561 A2 EP 1989561A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- data
- outliers
- test data
- test
- outlier threshold
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01R—MEASURING ELECTRIC VARIABLES; MEASURING MAGNETIC VARIABLES
- G01R31/00—Arrangements for testing electric properties; Arrangements for locating electric faults; Arrangements for electrical testing characterised by what is being tested not provided for elsewhere
- G01R31/28—Testing of electronic circuits, e.g. by signal tracer
- G01R31/2851—Testing of integrated circuits [IC]
- G01R31/2894—Aspects of quality control [QC]
Definitions
- Test data may come from a variety of sources, such as parametric electrical testing, optical inspection, scanning electron microscopy, energy dispersive x-ray spectroscopy, and focused ion beam processes for defect analysis and fault isolation. Testing is typically performed before device packaging (at wafer level) as well as upon completion of assembly (final test).
- TAI 1316 Gathering and analyzing test data is expensive and time consuming. Automatic testers apply signals to the components and read the corresponding output signals. The output signals may be analyzed to determine whether the component is operating properly. Each tester generates a large volume of data. For example, each tester may perform 200 tests on a single component, and each of those tests may be repeated 10 times. Consequently, a test of a single component may yield 2000 results. Because each tester is testing 100 or more components an hour and several testers may be connected to the same server, the test process generates an enormous amount of data.
- Figure 1 is a block diagram of a test system according to various aspects of the present invention and associated functional components;
- Figure 2 is a plot of test results including outliers and failures;
- FIG. 3 is a diagram of a system for automatically selecting one or more outlier identification algorithms
- Figure 4 is a plot of test results including hidden outliers
- Figure 5 is a diagram of a wafer and a sample path followed by a test prober over the wafer
- Figure 6 is a diagram of variations of test data as the prober follows the path
- Figure 7 is a flow diagram for identifying hybrid outliers
- Figure 8 is a diagram of a wafer and a spatial analysis window
- Figure 9 is a diagram of potential filter types for proximity analysis
- Figure 10 is a representation of an asymmetrical distribution
- Figures 1 IA-B are correlation charts of test data for two different tests.
- Figure 12 is a map of test results for a wafer.
- the present invention may be described in terms of functional block components and various process steps. Such functional blocks and steps may be realized by any number of hardware or software components configured to perform the specified functions.
- the present invention may employ various testers, processors, storage systems, processes, and algorithms, such as statistical engines, memory elements, signal processing elements, neural networks, pattern analyzers, logic elements, programs, and the like, which
- TAI 1316 may carry out a variety of functions under the control of one or more testers, microprocessors, or other control devices.
- the present invention may be practiced in conjunction with any number of test environments, and each system described is merely one exemplary application for the invention. Further, the present invention may employ any number of conventional techniques for data analysis, component interfacing, data processing, component handling, and the like.
- a method and apparatus operates in conjunction with a test system 100 having a tester 102, such as automatic test equipment (ATE) for testing semiconductors.
- the test system 100 comprises a tester 102 and a computer system 108.
- the test system 100 may be configured for testing any components 106, such as semiconductor devices on a wafer, circuit boards, packaged devices, or other electrical or optical systems.
- the components 106 may be applied to many environments having multiple data points, such as credit card fraud detection, athlete performance analysis, voting irregularity analysis, and severe weather prediction.
- the components 106 comprise multiple integrated circuit dies formed on a wafer or packaged integrated circuits or devices.
- the components 106 are created using a fabrication process, which may comprise any suitable manufacturing process for creating the components 106, and may include a test process, which may comprise any suitable process for testing the operation of the components 106.
- the tester 102 suitably comprises any test equipment that tests components 106 and generates output data relating to the testing, and may comprise multiple machines or other sources of data.
- the tester 102 may comprise a conventional automatic tester, such as a Teradyne tester or the like, and suitably operates in conjunction with other equipment for
- the tester 102 may be selected and configured according to the particular components 106 to be tested and/or any other appropriate criteria.
- the tester 102 may operate in conjunction with the computer system 108 to, for example, program the tester 102, load and/or execute the test program, collect data, provide instructions to the tester 102, analyze test data, control tester parameters, and the like.
- the computer system 108 receives tester data from the tester 102 and performs various data analysis functions independently of the tester 102.
- the computer system 108 may comprise a separate computer having a processor 110 and a memory 112, such as a personal computer or workstation, connected to or networked with the tester 102 to exchange signals with the tester 102.
- the computer system 108 may be omitted from or integrated into other components of the test system 100. and various functions may be performed by other components, such as the tester 102 or elements connected to the network.
- the memory 112 suitably stores a component identifier for each component 106, such as x-y coordinates corresponding to a position of the component 106 on a wafer map for the tested wafer.
- Each x-y coordinate in the memory 112 may be associated with a particular component 106 at the corresponding x-y coordinate on the wafer map.
- Each component identifier has one or more fields, and each field corresponds, for example, to a particular test performed on the component 106 at the corresponding x-y position on the wafer, a statistic related to the corresponding component 106, or other relevant data.
- the memory 112 may be configured to include any data identified by the user as desired according to any criteria or rules.
- the computer 108 of the present embodiment also suitably has access to a storage system, such as another memory (or a portion of the memory 112), a hard drive array, an
- the storage system may be local, like a hard drive dedicated to the computer 108 or the tester 102, or may be remote, such as a hard drive array associated with a server to which the test system 100 is connected.
- the storage system may store programs and/or data used by the computer 108 or other components of the test system 100.
- the storage system comprises a database 114 available via a remote server 116 comprising, for example, a main production server for a manufacturing facility.
- the database 114 stores tester information, such as tester data files, master data files for operating the test system 100 and its components, test programs, downloadable instructions for the test system 100, and the like.
- the storage system may comprise complete tester data files, such as historical tester data files retained for analysis.
- the test system 100 may include additional equipment to facilitate testing of the components 106.
- the present test system 100 includes a device interface 104, like a conventional device interface board and/or a device handler or prober, to handle the components 106 and provide an interface between the components 106 and the tester 102.
- the test system 100 may include or be connected to other components, equipment, software, and the like to facilitate testing of the components 106 according to the particular configuration, application, environment of the test system 100, or other relevant factors.
- the test system 100 is connected to an appropriate communication medium, such as a local area network, intranet, or global netw r ork like the internet, to transmit information to other systems, such as the remote server 1 16.
- the test system 100 may include one or more testers 102 and one or more computers
- the computer 108 may be separate from the tester 102, or may be integrated into the tester 102, for example utilizing one or more processors, memories, clock circuits,
- TAI 1316 and the like of the tester 102 itself may be performed by different computers.
- a test system 100 tests the components 106 and provides enhanced analysis and test results.
- the enhanced analysis may identify incorrect, questionable, or unusual results.
- the test system 100 may also analyze multiple sets of data, such as data taken from multiple wafers and/or lots of wafers, to generate composite data based on multiple datasets. Further the test data may include data from multiple sources, such as process control or electrical test (ET) data relating to the electrical characteristics for various points on the wafer and/or for the components 106.
- ETS electrical test
- bin map data one or more wafers indicating the pass/fail binning classifications for the components 106
- outlier data and outlier signature data such as categorizations of outliers as small, medium, or critical according to selected criteria.
- Various data may also be used by the test system 100 to diagnose characteristics in the fabrication, test, and/or other process, such as problems, inefficiencies, potential hazards, instabilities, or other aspects that may be identified via the test data.
- the operator such as the product engineer, test engineer, manufacturing engineer, device engineer, or other personnel using the test data and analyses, may then use the results to verify and/or improve the test system 100 and/or the fabrication system and classify the components 106.
- the test system 100 executes an enhanced test process for testing the components 106 and collecting and analyzing test data.
- the test system 100 suitably operates in conjunction with a software application executed by the computer 108.
- the software application of the present embodiment includes multiple elements for implementing the enhanced test process, including a configuration element, a
- the test system 100 may also include a composite analysis element for analyzing data from more than one dataset. Further, the test system may include a diagnostic system for identifying characteristics and potential problems using the test data.
- Each software element suitably comprises a software module operating on the computer 108 to perform various tasks.
- the configuration element prepares test system 100 for testing and analysis.
- output test data from the tester 102 and/or other sources are analyzed to generate supplementary test data, suitably at run time and automatically, in conjunction with an in-line process, or after processing.
- the supplementary test data is then transmitted to the operator or another system, such as the composite analysis element, the diagnostic system, and/or the output element.
- the test system 100 commences a test run, for example in conjunction with a conventional series of tests, in accordance with a test program.
- the tester 102 may perform multiple tests on each component 106 on a wafer or the wafer itself, and each test may be repeated several times on the same component 106.
- the tests may comprise any appropriate tests, such as (but not limited to) continuity, supply current, leakage current, parametric static, parametric dynamic, and functional and stress tests.
- Test data from the tester 102 is stored for quick access and supplemental analysis as the test data is acquired. The data may also be stored in a long-term memory for subsequent analysis and use.
- the output test data for each component, test, and repetition is stored by the tester 102 in a tester data file.
- the output test data received from each component 106 is analyzed by the tester 102 to classify the performance of the component 106, such as into a particular bin classification, for example by comparison to the upper and lower test limits, and the results of the classification are also stored in the
- the tester data file may include additional information as well, such as logistics data and test program identification data.
- the tester data file is then provided to the computer 108 in an output file, such as a standard tester data format (STDF) file, and stored in memory.
- STDF standard tester data format
- the tester data file may also be stored in the storage system for longer term storage for later analysis, such as by the composite analysis element.
- the supplementary data analysis element analyzes the data to provide enhanced output results.
- the computer 108 may provide any appropriate analysis of the tester data to achieve any suitable objective.
- the supplementary data analysis element 206 may implement a statistical engine for analyzing the output test data and identifying data and characteristics of the data of interest at run time or later. The data and characteristics identified may be stored, while data that is not identified may be otherwise disposed of. such as stored or discarded.
- the supplementary data analysis element 206 may perform the supplementary data analysis at run time, as an in-line process, or as an off-line process.
- the present supplementary data analysis is performed as an in-line process, i.e., as an automatic function integrated into the testing process.
- the computer 108 may perform additional analysis functions upon the generated statistics and the output test data. Each test generates at least one result for at least one of the components.
- an exemplary set of test results for a single test of multiple components comprises a first set of test results having statistically similar values and a second set of test results characterized by values that stray from the first set.
- Each test result may be compared to an upper test limit and a lower test limit. If a particular result for a component exceeds either limit, the component may be classified as a "bad part" or otherwise classified according to the test and/or the test result.
- TAI 1316 Some of the test results in the second set that stray from the first set may exceed the control limits, while others do not. For the present purposes, those test results that stray from the first set but do not exceed the control limits or otherwise fail to be detected are referred to as "outliers". Outliers are generally considered to be observations which appear to be inconsistent with the remainder of a set of data. The outliers in the test results may be identified and analyzed for any appropriate purpose, such as to identify potentially unreliable components. The outliers may also be used to identify various potential problems and/or improvements in the test and manufacturing processes.
- Analyzing each relevant datum according to the selected algorithm suitably identifies the global and/or hybrid outliers. If a particular algorithm is inappropriate for a set of data, the computer 108 may select a different algorithm. The computer 108 may operate in any suitable manner to designate outliers, such as by comparison to pre-selected or dynamic values. For example, an outlier identification system according to various aspects of the present invention initially automatically calibrates its sensitivity to outliers based on selected statistical relationships for each relevant datum or other data. Some of these statistical relationships are then compared to a threshold or other reference point, such as the data mode, mean, or median, or combinations thereof, to define relative outlier threshold limits.
- a threshold or other reference point such as the data mode, mean, or median, or combinations thereof
- the statistical relationships are scaled, for example by one, two, three, and six standard deviations of the data, to define the different outlier amplitudes.
- the output test data may then be compared to the outlier threshold limits to identify and categorize the output test data as outliers.
- the computer 108 stores the resulting statistics and outliers, as well as corresponding identifiers, such as the x-y wafer map coordinates. Selected statistics, outliers, and/or failures
- TAI 1316 may also trigger notification events, such as sending an electronic message to an operator, triggering a light tower, stopping the tester 102, or notifying a server.
- the supplementary data analysis element 206 includes a scaling element 210 and an outlier classification engine 212.
- the scaling element 210 is configured to dynamically scale selected coefficients and other values, for example according to the output test data.
- the outlier classification engine 212 is configured to identify and/or categorize the various outliers in the data according to selected algorithms.
- the scaling element of the present embodiment uses various statistical relationships for dynamically scaling outlier sensitivity.
- the scaling coefficients may be calculated by the scaling element and used to modify selected outlier sensitivity values. Any appropriate criteria, such as suitable statistical relationships, may be used for scaling.
- the outlier classification engine 212 is suitably configured to identify and/or categorize the outliers in the components 106, output test data, and/or analysis results according to any suitable algorithms.
- the outlier classification engine 212 may be configured to utilize multiple candidate outlier identification algorithms and identify one or more algorithms suited for identifying outliers in the output test data. Different tests generate different population distributions, such that an outlier identification algorithm that is appropriate for one test may be inappropriate for another.
- the outlier classification engine 212 is suitably configured to differentiate between different data populations and automatically select one or more outlier identification algorithms based on the data population type of the current data. The automatic selection may select from any appropriate set of candidate outlier identification algorithms, and may perform the selection according to any suitable criteria and analysis.
- the outlier classification engine 212 may be configured to automatically perform an outlier identification algorithm selection process.
- the outlier identification algoritm selection process may be performed to select one or more appropriate algorithms from multiple algorithms for identifying global and/or hybrid outliers.
- the outlier classification engine 212 comprises a pre-processing engine 310 and a classification engine 312.
- the pre-processing engine 310 suitably generates data to facilitate selection of the relevant outlier identification algorithms.
- the classification engine 312 suitably selects one or more relevant outlier identification algorithms and identifies the global and/or hybrid outliers accordingly.
- the output test data for example data taken from a particular test, are initially provided to the outlier classification engine 212 to analyze the output test data for compatibility with various candidate outlier identification algorithms.
- the data may be analyzed in any suitable manner to identify appropriate algorithms for identifying the outliers in the output test data.
- the pre-processing engine 310 receives the output test data and prepares the available outlier identification algorithms, such as by retrieving them from an outlier identification algorithm library stored in memory.
- the pre-processing engine 3010 analyzes the output test data for outliers using several of the available algorithms.
- the pre-processing engine 310 analyzes the output test data using each of the algorithms designated by the user, or another suitable selection of algorithms, to generate pre-processing data, such as outliers as identified by all algorithms and various descriptive statistics, such as minimum, maximum, mean, median, standard deviation, CPK, CPM, and the like.
- the algorithms may be based on industry standard (e.g., IQR, median +/- N*sigma, etc.) and/or proprietary, custom, or user-defined outlier identification techniques.
- industry standard e.g., IQR, median +/- N*sigma, etc.
- proprietary, custom, or user-defined outlier identification techniques e.g., IQR, median +/- N*sigma, etc.
- TAI 1316 identification algorithm library is suitably configurable by the user, for example to add, remove, or edit outlier identification algorithms, for example according to the particular products under test or the characteristics of the tests to be performed. Different algorithms may be appropriate for different statistical population types, such as normal, logarithmic normal, bimodal, clamped, or low CPK data populations.
- the candidate outlier identification algorithms may comprise any suitable algorithms for various types and distributions of data, such as inter-quartile range (IQR) normal distribution, 3 sigma; IQR normal distribution, 6 sigma; IQR log normal. 3 sigma; IQR log normal, 6 sigma; bimodal algorithms; clamped algorithms; low capability algorithms; custom algorithms based on 3-, 6-, or n-sigma; and proprietary algorithms having various sensitivities.
- IQR inter-quartile range
- the pre-processing algorithm results are dynamically selected for global and/or hybrid outlier detection.
- the outlier classification engine 212 analyzes the test results generated by the pre-processing engine 310 to identify the most useful or applicable outlier identification algorithms. The data from the selected outlier identification algorithms may be retained, while the remaining data is discarded.
- the classification engine 312 receives the results of the pre-processing analysis generated by each of the available outlier identification algorithms.
- the classification engine 3012 analyzes the pre-processing data according to any suitable criteria, such as predetermined and/or user-defined recipe-driven rules to determine whether the pre-processing data satisfy various criteria.
- the rales may be any appropriate rules, for example employing statistical ratios or values, such as comparing statistics, like minimum, maximum, mean, median, standard deviation, CPK, and CPM, to various thresholds or other criteria.
- the classification engine 3012 may skip the outlier detection process under certain circumstances,
- TAI 1316 such as having too few test results or a too narrow or a bimodal distribution among the test results.
- the rules may be pre-selected and/or may be adjusted or added by the user to accommodate specific conditions of the products and test environment.
- the classification engine 312 may be configured to apply a particular algorithm to a certain type of test, for example when the results of the test are known to have a particular distribution. Other rules may determine whether a particular test is applicable. For example, the classification engine 312 may compare the CPK to a threshold. If the CPK is below the threshold, then the IQR normal outlier identification algorithm may be used. In the present system, results from an algorithm satisfying a rule are used for outlier identification. Other algorithm results for that test are suitably ignored.
- the outlier classification engine 212 may also categorize selected global and/or hybrid outliers and components 106 according to the test results and the information generated by the supplementary analysis element 206.
- the outlier classification engine 212 may be configured to categorize the components 106 into critical/marginal/good part categories, for example in conjunction with user-defined criteria; user-defined good/bad spatial patterns recognition; classification of pertinent data for tester data compression; test setup in-situ sensitivity qualifications and analysis; tester yield leveling analysis; dynamic wafer map and/or test strip mapping for part dispositions and dynamic retest; or test program optimization analyses.
- the outlier classification engine 212 may classify components 106 and associated data in accordance with conventional SPC control rules, such as Western Electric rules or Nelson rules, to characterize the data.
- the outlier classification engine 112 suitably categorizes the data using a selected set of classification limit calculation methods. Any appropriate categorization methods may be used to characterize the data according to the needs of the operator.
- TAI 1316 classification engine 212 categorizes outliers by comparing the output test data to selected thresholds, such as values corresponding to one, two, three, and six statistically scaled standard deviations from a threshold, such as the a test limit or a data mean, mode, and/or median.
- selected thresholds such as values corresponding to one, two, three, and six statistically scaled standard deviations from a threshold, such as the a test limit or a data mean, mode, and/or median.
- the outlier thresholds may be defined asymmetrically with respect to a selected center point in the data distribution. Using asymmetrical thresholds may reduce the effects of non-Gaussian distributions and/or the presence of outliers on identifying the outliers.
- the outlier thresholds may be selected without determining a particular location in the data, such as a mean, or a scale, such as a standard deviation, based on the data, which can be influenced by outliers.
- the outlier thresholds may be selected according to the distribution of the data, such as by deriving the thresholds from a center point of the frequency distribution of the test data.
- the outlier thresholds 1010, 1012 may be adjusted or selected according to the quartile values 1014, 1016 for the set of data.
- the quartile values correspond to the median values between the median of the full data set and the edge of the distribution.
- the thresholds 1010, 1012 may be adjusted according to any relevant criteria, such as the distribution of the data.
- the thresholds 1010, 1012 are adjusted according to the generalized slope of the data distribution in the relevant area, such as around the first and third quartile points 1014, 1016.
- the thresholds 1010, 1012 may be adjuusted by an amount that is inversely related to the general slope of the data in the relevant area. Consequently, in an area where the slope is lower, the outlier thresholds 1010, 1012 may be farther from the center point than in an area where the slope is
- the lower threshold and upper threshold 1010, 1012 may be selected according to the equations:
- UCL and LCL are the upper and lower outlier thresholds 1010, 1012 respectively
- Ql and Q3 are respectively the first and third quartile marks
- N is a constant
- UQW and LQW are weights assigned according to an inverse relationship to the slope.
- Additional thresholds may be defined as well, for example to define small, medium, and large outliers.
- the thresholds and categories may be defined in any manner.
- the category thresholds may be determined according to scaled statistical relationships, for example by one, two, three, and six standard deviations of the data from the median, to define the different outlier amplitudes.
- the output test data may then be compared to the outlier threshold limits to identify and categorize the output test data as outliers.
- the categories may be defined relative to the outlier thresholds.
- the demarcations between the various categories may be selected according to any suitable criteria, such as according to the outlier threshold, the test limit, and/or the edge of the distribution.
- the range between the outlier threshold and the test limit may be divided into two or more categories 1020, 1022, 1024.
- the range between the outlier threshold and the test limit may be divided into equal areas or different areas to define different magnitudes of outliers.
- each side of the distribution between the outlier threshold and the edge of the distribution is divided into three or four equal categories 1020, 1022, 1024 to define the outliers as large, medium, small, and/or tiny.
- the size and number of the various categories may be identical on both sides of the distribution, or may vary for each side of the distribution.
- the outlier classification engine 112 analyzes and correlates the normalized outliers and/or the raw data points based on user-defined rales.
- the outlier classification engine 1 12 suitably performs the categorization according to each test, which may be performed independently of data from other tests or in conjunction with such data from other tests. Any suitable criteria may be used for categorizing the components based on test failures and outliers, such as:
- FAIL if the part fails at least one test.
- CRITICAL if at least one test has been identified to have a LARGE outlier or at least two MEDIUM outliers on two different tests for the otherwise passing part.
- MARGINAL if at least one test has been identified to have a MEDIUM outlier or at least four SMALL outliers on four different tests for the otherwise passing part.
- SMALL if at least one test has been identified to have a SMALL outlier for the otherwise passing part.
- PASS without any SMALL, MEDIUM, or LARGE outlier for the otherwise passing part.
- Criteria for small, medium, and large outliers may be selected according to any suitable criteria, such as thresholds based on the test limits and/or characteristics of the data.
- the outlier classification engine 1 12 may also include a hybrid outlier system configured to identify hybrid outliers, also referred to as local outliers or hidden outliers, in the data. Hybrid outliers are inconsistent data points within a local group of data, but do not cross the thresholds defining the "global" outlier boundaries of the main population.
- the outlier classification system may analyze the test data to identify outliers according to two or more parameters.
- Other parameters may be any appropriate data, such as data for spatially related components, data resulting from correlated tests, or other relevant data.
- spatial information may be used to identify devices whose parameters are all within the main distribution, but not consistent with data for a more local physical area.
- a high level of correlation between two parameters may be used to identify devices whose parameters are individually consistent with the main distribution, but are not consistent in their relationship to another parameter.
- the degree of correlation between two parameters may be assessed in any manner.
- a correlation index may be calculated using linear or logarithmic regression techniques, such as to establish a best-fit straight line between two data sets. The closer the magnitude of the slope of this line is to unity, the higher the degree of correlation. Determining the degree of correlation include any relevant considerations or processes, such as removing large outliers or compensating for multi-site variation.
- the correlation of test results may be determined to identify outliers that do not conform to the underlying relationship.
- FIG 1 IA for tests with a high degree of correlation, all results are expected to reside within a certain distance 1 1 12 from the best-fit straight line 11 10.
- Figure 1 IB if an outlying result 1114 is present, then the overall correlation index slightly affected, but the corresponding data points resides outside the expected distance 1112 from the best-fit straight line 1 110, even if the test result remains within the main distribution for the relevant parameter.
- the computer 108 may also select situations for performing correlation analyses,
- TAI 1316 for example according to the degree of correlation between two parameters, availability of computing resources, and demand for optimal analysis.
- hybrid outliers may be identified in view of the test data for the component relative to data for spatially related components, such as components from a smaller local population of data points, like a temporally or spatially local population.
- the tester generates data by testing various components on the wafer sequentially. The tester moves the prober from component to component, accumulating test data for each component. Due to variations in the manufacturing process and materials and the testing process, the test data may vary regularly as the prober traverses the wafer ( Figure 6). Data points 610 lying well outside the variations for the wafer or for multiple wafers are ordinarily classified as global outliers. Data points 604 within or slightly beyond the variations ordinarily escape classification as global outliers, even if the data points 604 significantly differ from the data points for spatially nearby or corresponding components.
- Hybrid outliers are suitably identified by analyzing individual raw, normalized, or otherwise processed data points with respect to proximate data points.
- the outlier classification engine 1 12 may apply a proximity analysis by comparing parametric or other data for individual components to hybrid outlier thresholds calculated using data for spatially related components.
- the proximity analysis may also be weighted, for example according to the distance between a central component and a nearby component.
- Proximity analysis may be performed by any appropriate system, such as a dedicated proximity analysis engine or a proximity analysis engine associated with another task, like a proximity analysis engine used for generating composite representations based on multiple datasets.
- the outlier classification engine As parametric data is processed, the outlier classification engine
- the 112 may calculate hybrid outlier thresholds for a local data set, such as a selected number of most recently generated data.
- the data to be analyzed may comprise any suitable data, such as preceding data points, subsequent data points, both preceding and subsequent data points, temporally related data points, or spatially related data points.
- the number of data points in the local data set may be selected according to any suitable criteria, such as using a pre-selected number of data points or selecting the number of data points according to various criteria, such as the variability or noisiness of the data. If the data is noisy, the number of data points may be automatically adjusted, for example by increasing the number of data points in the local data set to reduce the effects of the noise.
- the hybrid outlier thresholds may be dynamically re-calculated with accumulation of new parametric data, for example by using a first-in, first-out (FIFO) calculation.
- the parametric data for the components in the area may then be compared to the thresholds to identify hybrid outliers.
- the outlier classification engine 1 12 may be configured to identify the hybrid outliers
- the hybrid outliers 604 are identified in conjunction with an in-line process, i.e., performed automatically via a process that receives the data as it is generated or after testing is complete and the data stored and automatically provides the results without operator intervention or other action.
- the hybrid outliers 604 may be identified at run time or after the analysis is completed and the data stored.
- the outlier classification engine 1 12 may be integrated into the test program executed by the tester 102.
- the test data may comprise any type of test data, such as bin results or parametric testing data. Further, the data may be pre- processed, such as by removing global outliers and/or data for components classified as
- TAI 1316 failures or normalizing data across various sections of the wafer, such as sections associated with multisite testing or stepper fields.
- the outlier classification engine 112 or other outlier identification system initially filters out global outliers from the data and processes the remaining data to identify hybrid outliers (710).
- the outlier classification engine 112 may further process the filtered data, for example to normalize the data in view of known sources of variation (712).
- the outlier classification engine 112 may then identify hybrid outliers according to any suitable technique or criteria.
- the outlier classification engine 1 12 may select and/or adjust a window' encompassing a selected number and geometry of components to identify spatially related components (714).
- the outlier classification engine 112 may then determine a neighborhood value for the component corresponding to the test data for the components in the neighborhood (716).
- the test data for the component may then be analyzed relative to the neighborhood value to identify hybrid outliers.
- the data may be normalized, which comprises adjusting data values to correct for differences in data that may be generated due to known causes that negatively impact the consistency of the data.
- the hybrid outlier system normalizes the test data for each component relative to test data for all other components on the wafer to facilitate comparison. For example, if the test data was generated using multi- site testing in which multiple resources test wafers or components
- the data may be normalized in any suitable manner, such as to address known sources of variation. Components sharing a known common source of inconsistency may be normalized using the same normalization criteria. For example, in the present embodiment,
- TAI 1316 parametric test data generated in conjunction with a multi-site testing environment may be normalized according to the following equation: data site,, - median site,.
- norm_site n is the resulting normalized data
- data_site n is the prenormalized data
- median sit ⁇ n is the statistic median
- iqr_site n is the interquartile range for the site analyzed.
- the normalized data may be stored in any suitable manner, for example on a normalized data device map having the normalized data for each device.
- the outlier classification engine 112 analyzes local data, such as data for spatially near components. For example, the outlier classification engine 112 may detect hybrid outliers according to the geometry of the wafer. The hybrid outliers may be identified according to any suitable technique or process. In the present embodiment, the outlier classification engine 1 12 analyzes the data for nearby components.
- the computer 108 may perform a proximity analysis for each component on the wafer.
- the computer 108 may be configured to identify hybrid outliers by identifying outliers within subsets of the overall dataset, such as within data for a selected group of components on a particular wafer.
- the computer 108 may establish a pattern or window 810 of a selected size that may be used to select multiple components on a wafer 812.
- the pattern or window 810 comprises a perimeter defining a spatial area of interest, such as a subset of the components on the wafer 812.
- the size of the window 810 may be selected or adjusted according to any suitable criteria, such as the number of components proximate the central component, the type of test or component, or the desired sensitivity of the outlier analysis.
- the shape of the window may likewise be selected according to any appropriate criteria, such as the spatial
- TAI 1316 area of interest.
- the area of interest does not facilitate the use of a rectangular window, such as near the curved edges of the wafer 812
- an alternative shape may be selected for the window 810 to accommodate the relevant area of the wafer 812.
- the outlier classification engine 1 12 may employ one or more predetermined windows or masks that may be applied to each component in the data set to establish the relevant window.
- the size and shape of the windows may be selected according to any suitable criteria, such as according to a preselected type, an operator designation, or an automatic selection process.
- the user can predefine the type and size of the window to be used or the outlier classification engine 112 can automatically adapt the window shape and/or size according to the nature of the area surrounding the relevant component, such as for locations near the edge of the wafer, possible sampling factors, empty spaces on the wafer or missing data, and the like.
- the outlier classification engine 112 requires a selected amount of data, such as data for at least four to eight surrounding components, such as five, to perform the analysis.
- the devices only count towards the minimum if they satisfy one or more selected criteria, such as being classified as passing or "good" devices. If the initial area within the window has fewer than five other "good" devices, the outlier classification engine 112 may adjust increase the size and/or shape of the window or pattern until the number of components meets the required number.
- the outlier classification engine 112 adjusts the window or pattern by applying a predetermined sequence of patterns, in which the sequence comprises patterns of increasing size.
- the outlier classification engine 112 may abort the analysis if a minimum number of acceptable data is not available. For example, the outlier classification engine 112 may only perform the analysis and/or adjust the size and/or shape of the window if the device under
- TAI 1316 study has at least five "bin 1" devices in the initial window.
- data for a particular device may be categorized as a hybrid outlier only if there are sufficient data for neighboring components in the initial window.
- substitute data may be used.
- the missing data may be replaced with idealized data representing ideal values for such components, or with data based on other components on the wafer, such as components in surrounding bands around the missing data or components in corresponding positions on other wafers in the lot.
- the window 810 is then applied to various areas of the wafer 812, and hybrid outliers are identified among the data for components within the window 810.
- the outlier classification engine 112 may identify the hybrid outliers according to any appropriate criteria and technique. For example, the outlier classification engine 112 may initially analyze a starting component, such as a component at the left end of the top row r of the wafer, and sequentially analyze data for each component on the wafer. The outlier classification engine 112 may adjust the size and shape of the window as each component is analyzed. Data for the components within the window are then retrieved from memory and analyzed for outliers, for example using only the data for components within the window.
- the area within the window may be characterized and/or analyzed according to any suitable criteria or technique.
- the outlier classification engine 112 may calculate one or more representative neighborhood values for each component based on the data for the surrounding components. A window is applied to each component to generate a neighborhood value for each component for the relevant test. The data for each component may then be analyzed and/or compared to the corresponding neighborhood value to identify hybrid outliers.
- the outlier classification engine 1 12 calculates a neighborhood value for the window corresponding to a derived value derived from values for the components in the neighborhood.
- the derived value may comprise an average value representing a central value for multiple values, such as an arithmetic mean, statistic median, mode, geometric mean, weighted mean, and the like.
- the present outlier classification engine 112 employs the median value as the neighborhood value, as the median may often operate as a robust estimator of the center of the local neighborhood.
- the median value is then stored, for example on a neighborhood value device map storing the neighborhood value calculated for each device.
- the outlier classification engine 112 moves the window to the next component.
- the outlier classification engine 1 12 may apply different weights to the data of surrounding components, such as according to the proximity of a device to the central device. For example, data for components closer to the central component may have greater affect in calculating the neighborhood value than data for components that are farther away.
- Hybrid outliers may be identified by analyzing the data for each component in view of the corresponding neighborhood values. Any suitable outlier detection system or approach may be used to identify the hybrid outliers. For example, the outlier classification engine 112 may calculate a unique hybrid outlier threshold based on the neighborhood value for each component and compare the raw data to the calculated threshold. Alternatively, the outlier classification engine 112 may generate a value based on the raw data and the neighborhood value for each component and compare the value to a threshold or the like. Locations of identified hybrid outliers are stored, along with any other relevant information.
- the outlier classification engine 1 12 generates a residual value for each component according to the raw data and the
- the residual component may be determined according to any suitable criteria or technique.
- residuals are calculated using the following equation:
- RMSE defines the square root of the mean square error of values for all relevant points, such as for all devices on the wafer.
- RMSE is the root mean square of the sum of the differences between the normalized and smoothed data for each device on the wafer. This equation produces a standardized error, which is approximately a normal distribution with mean 0 and variance of 1.
- the resulting residual data may be stored, for example in a residual data device map storing residual data values for each device.
- Outlier detection techniques may be applied to the residual values for the various components to identify outliers.
- the outlier classification engine 112 may calculate control limits based on the residual values, and the residual values exceeding the limits may be identified as hybrid outliers.
- the outlier classification engine 112 may analyze the residual data according to one or more outlier identification algorithms to identify outliers in the residual data. The corresponding devices may then be designated as outliers for the relevant test.
- the outlier classification engine 1 12 may also be configured to perform additional analysis of the hybrid outlier data.
- the outlier classification engine 112 may identify and categorize selected hybrid outliers and components 106, such as to categorize the components 106 into multiple categories as described previously regarding global outliers.
- the hybrid outliers are categorized as small, medium, or large hybrid outliers, for example in conjunction with user-defined criteria; user-defined spatial patterns recognition; categorization of pertinent data for tester data compression; test setup in-situ sensitivity qualifications and analysis; tester yield leveling analysis; dynamic wafer map and/or test strip mapping for part dispositions and dynamic retest; or test program optimization analyses.
- the outlier classification engine 112 suitably classifies the hybrid outlier data using a selected set of classification limit calculation methods. Any appropriate classification methods may be used to characterize the hybrid outlier data according to the needs of the operator.
- the present outlier classification engine 112 categorizes hybrid outliers by comparing the relevant test data or derived data to selected thresholds, such as values corresponding to one, two. three, and six statistically scaled standard deviations from a threshold, such as the test limits or a data mean, mode, and/or median for a relevant geographic area.
- the categorization of hybrid outliers in this manner tends to normalize any identified hybrid outliers for any test regardless of datum amplitude and relative noise.
- the outlier classification engine 112 may categorize hybrid outliers according to the magnitude of the difference between the test data for the central component and the derived value for the plurality of local components.
- the categorization may be based directly on the difference or indirectly, such as according to a value based on or derived from the difference.
- the outlier classification engine 1 12 analyzes and correlates the normalized hybrid outliers and/or the raw data points based on user-defined rules.
- the hybrid outlier classification engine 112 suitably performs the categorization according to each test, which may be performed independently of data from other tests or in conjunction with such data
- Criteria for small, medium, and large outliers may be selected according to any suitable criteria, such as thresholds based on the test limits and/or characteristics of the data.
- the outlier classification engine 112 may forego the hybrid outlier detection analysis or discard the results of such analysis if the results appear to be unlikely to yield useful information. For example, the outlier classification engine 112 may compare the number of test data points for a particular test to a threshold, such as 20. If there are fewer than the threshold number of test data points for analysis, the outlier classification engine 112 may forego the hybrid outlier analysis. Similarly, if the number of hybrid outliers detected exceeds a threshold, such as 10, the hybrid outlier analysis results may be discarded as generating too many results to be considered hybrid outliers. Alternatively, the outlier classification engine 1 12 may perform and/or retain the results, but annotate the results to indicate the potentially suspect nature of the data.
- a threshold such as 20
- the outlier classification engine 1 12 may also identify components that are near clusters of failures and outliers. Such components in the same area might exhibit problems similar to the outliers and failures in the area, but the problems may not be identified upon initial analysis of the test data. Thus, the outlier classification engine 112 may identify a "good die in a bad neighborhood" using failure data, global (or single-parameter) outlier data, and/or hybrid (or multiple-parameter) outlier data.
- the outlier classification engine 112 may identify relevant neighborhoods and nearby components in any appropriate manner. For example, referring to Figure 12, the outlier classification engine 112 may initially identify the relevant neighborhoods by analyzing the failure data 1210, global outlier data 1212, and/or hybrid outlier data 1212 to identify groups 1214 of adjacent or otherwise spatially related components. The neighborhoods around the
- TAI 1316 groups 1214 may be defined according to any criteria, such as size, shape, or location of the group 1214.
- the neighborhoods may be defined according to one or more types of data, such as groups 1214 of failures 1210, global outliers 1212, or hybrid outliers 1212.
- the outlier classification engine 1 12 identifies neighborhoods comprising a minimum number of test results for adjacent components 106 having failures 1210, global outliers 1212, and hybrid outliers 1212.
- the outlier classification engine 112 identifies spatially related test results that do not comprise failures 1210, global outliers 1212, and hybrid outliers 1212. For example, test results for components 106 adjacent at least one, two, or three components 106 in the group 1214. or within a selected distance from the group 1214, may be designated for further analysis.
- the identified components 106 may be designated, for example as "suspect" components 1216 or the like.
- the identified suspect components 1216 and test results may be further analyzed, designated as outliers, and/or reported.
- the computer 108 collects data from the test system 100, suitably at run time or in conjunction with an in-line process, and provides an output report to a printer, database, operator interface, or other desired destination. Any form, such as graphical, numerical, textual printed, or electronic form, may be used to present the output report for use or subsequent analysis.
- the output element 208 may provide any selected content, including selected output test data from the tester 102 and results of the supplementary data analysis.
- the output element 208 suitably provides a selection of data from the output test data specified by the operator as well as supplemental data.
- the computer 108 may also be configured to include information relating to the outliers, or other information generated or identified by the supplementary data analysis element. If so
- the identifiers such as x-y coordinates, for each of the outliers are assembled as well.
- the coordinates for the operator-selected components and the outliers are merged into an output report, which in the current embodiment is in the format of the native tester data output format. Merging resulting data into the dynamic datalog facilitates compression of the original data into summary statistics and critical raw data values into a smaller native tester data file, reducing data storage requirements without compromising data integrity for subsequent customer analysis.
- the retrieved information is then suitably stored.
- the report may be prepared in any appropriate format or manner.
- the output report suitably includes the dynamic datalog having a wafer map indicating the selected components on the wafer and their categorization.
- the output element 208 may superimpose wafer map data corresponding to outliers on the wafer map of the preselected components.
- the output element may include only the outliers from the wafer map or batch as the sampled output.
- the output report may also include a series of graphical representations of the data to highlight the occurrence of outliers and correlations in the data.
- the output report may further include recommendations and supporting data for the recommendations. For example, if two tests appear to generate identical sets of failures and/or outliers, the output report may include a suggestion that the tests are redundant and recommend that one of the tests be omitted from the test program.
- the recommendation may include a graphical representation of the data showing the identical results of the tests.
- the output report may be provided in any suitable manner, for example output to a local workstation, sent to a server, activation of an alarm, or any other appropriate manner (step 712).
- the output report may be provided off-line such that the output does not affect the operation of the system or transfer to the main server.
- the computer 108 copies data files, performs the analysis, and generates results, for example for demonstration or verification purposes.
- the particular implementations shown and described are merely illustrative of the invention and its best mode and are not intended to otherwise limit the scope of the present invention in any way.
- conventional signal processing, data transmission, and other functional aspects of the systems (and components of the individual operating components of the systems) may not be described in detail.
- the connecting lines shown in the various figures are intended to represent exemplary functional relationships and/or physical couplings between the various elements. Many alternative or additional functional relationships or physical connections may be present in a practical system.
- the present invention has been described above with reference to a preferred embodiment. Changes and modifications may be made, however, without departing from the scope of the present invention. These and other changes or modifications are intended to be included within the scope of the present invention, as expressed in the following claims.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- Microelectronics & Electronic Packaging (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Testing Or Measuring Of Semiconductors Or The Like (AREA)
- Tests Of Electronic Circuits (AREA)
- Complex Calculations (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US77468206P | 2006-02-17 | 2006-02-17 | |
PCT/US2007/062366 WO2007098426A2 (en) | 2006-02-17 | 2007-02-17 | Methods and apparatus for data analysis |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1989561A2 true EP1989561A2 (en) | 2008-11-12 |
Family
ID=38438077
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07757164A Withdrawn EP1989561A2 (en) | 2006-02-17 | 2007-02-17 | Methods and apparatus for data analysis |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1989561A2 (ja) |
JP (3) | JP2009527903A (ja) |
IL (1) | IL193447A (ja) |
WO (1) | WO2007098426A2 (ja) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5537346B2 (ja) * | 2010-09-06 | 2014-07-02 | 株式会社日立メディコ | 画像診断支援装置、画像診断支援方法 |
JP5767963B2 (ja) * | 2011-12-28 | 2015-08-26 | 株式会社キーエンス | 外観検査装置、外観検査方法及びコンピュータプログラム |
JP6532373B2 (ja) * | 2015-10-16 | 2019-06-19 | 株式会社Nttファシリティーズ | 蓄電池劣化推定システム、蓄電池劣化推定方法および蓄電池劣化推定プログラム |
JP6752661B2 (ja) * | 2016-09-05 | 2020-09-09 | 日置電機株式会社 | 処理装置、検査システムおよび処理プログラム |
EP3451219A1 (en) * | 2017-08-31 | 2019-03-06 | KBC Groep NV | Improved anomaly detection |
US20200004619A1 (en) * | 2018-06-28 | 2020-01-02 | Honeywell International Inc. | System and method for detecting a shift in real data trend using the configurable adaptive threshold |
US11157346B2 (en) * | 2018-09-26 | 2021-10-26 | Palo Alto Rsearch Center Incorporated | System and method for binned inter-quartile range analysis in anomaly detection of a data series |
CN113486003B (zh) * | 2021-06-02 | 2024-03-19 | 广州数说故事信息科技有限公司 | 数据可视化时考虑异常值的企业数据集处理方法及系统 |
US11907088B2 (en) * | 2021-12-15 | 2024-02-20 | Synopsys, Inc. | Testing of hardware queue systems using on device test generation |
CN118131054B (zh) * | 2024-01-12 | 2024-09-03 | 东莞市丰晖电子有限公司 | 一种钠离子电池荷电状态智能监测方法及系统 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0352247A (ja) * | 1989-07-20 | 1991-03-06 | Seiko Instr Inc | 半導体試験装置 |
JPH04283046A (ja) * | 1991-03-12 | 1992-10-08 | Nec Corp | 作業順序制御方式 |
JP2002016118A (ja) * | 2000-06-29 | 2002-01-18 | Agilent Technologies Japan Ltd | 半導体パラメトリック試験装置 |
US7356430B2 (en) * | 2001-05-24 | 2008-04-08 | Test Advantage, Inc. | Methods and apparatus for data analysis |
US8417477B2 (en) * | 2001-05-24 | 2013-04-09 | Test Acuity Solutions, Inc. | Methods and apparatus for local outlier detection |
EP1479025B1 (en) * | 2001-05-24 | 2010-09-29 | Test Advantage, Inc. | Methods and apparatus for semiconductor testing |
JP2006146459A (ja) * | 2004-11-18 | 2006-06-08 | Renesas Technology Corp | 半導体デバイスの製造方法および製造システム |
JP5116307B2 (ja) * | 2007-01-04 | 2013-01-09 | ルネサスエレクトロニクス株式会社 | 集積回路装置異常検出装置、方法およびプログラム |
-
2007
- 2007-02-17 EP EP07757164A patent/EP1989561A2/en not_active Withdrawn
- 2007-02-17 WO PCT/US2007/062366 patent/WO2007098426A2/en active Application Filing
- 2007-02-17 JP JP2008555533A patent/JP2009527903A/ja active Pending
-
2008
- 2008-08-14 IL IL193447A patent/IL193447A/en active IP Right Grant
-
2009
- 2009-04-03 JP JP2009091599A patent/JP5080526B2/ja active Active
-
2010
- 2010-02-05 JP JP2010024869A patent/JP5907649B2/ja not_active Expired - Fee Related
Non-Patent Citations (1)
Title |
---|
See references of WO2007098426A3 * |
Also Published As
Publication number | Publication date |
---|---|
JP2009527903A (ja) | 2009-07-30 |
WO2007098426A3 (en) | 2008-11-13 |
IL193447A (en) | 2015-05-31 |
IL193447A0 (en) | 2009-05-04 |
JP2010197385A (ja) | 2010-09-09 |
JP2009188418A (ja) | 2009-08-20 |
WO2007098426A2 (en) | 2007-08-30 |
JP5907649B2 (ja) | 2016-04-26 |
JP5080526B2 (ja) | 2012-11-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8788237B2 (en) | Methods and apparatus for hybrid outlier detection | |
US8606536B2 (en) | Methods and apparatus for hybrid outlier detection | |
US7904279B2 (en) | Methods and apparatus for data analysis | |
US8000928B2 (en) | Methods and apparatus for data analysis | |
US7167811B2 (en) | Methods and apparatus for data analysis | |
US6792373B2 (en) | Methods and apparatus for semiconductor testing | |
EP1989561A2 (en) | Methods and apparatus for data analysis | |
US7225107B2 (en) | Methods and apparatus for data analysis | |
US20180293500A1 (en) | Methods and apparatus for data analysis | |
US8041541B2 (en) | Methods and apparatus for data analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20080915 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA HR MK RS |
|
R17D | Deferred search report published (corrected) |
Effective date: 20081113 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G06F 19/00 20060101AFI20081120BHEP |
|
RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: SCOTT, MICHAEL, J. Inventor name: MIGUELANEZ, EMILIO Inventor name: LABONTE, GREG |
|
DAX | Request for extension of the european patent (deleted) | ||
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20120901 |