id sid tid token lemma pos qz20sq89x8q 1 1 the the DET qz20sq89x8q 1 2 problem problem NOUN qz20sq89x8q 1 3 of of ADP qz20sq89x8q 1 4 finding find VERB qz20sq89x8q 1 5 structure structure NOUN qz20sq89x8q 1 6 in in ADP qz20sq89x8q 1 7 big big ADJ qz20sq89x8q 1 8 data data NOUN qz20sq89x8q 1 9 sets set NOUN qz20sq89x8q 1 10 is be AUX qz20sq89x8q 1 11 becoming become VERB qz20sq89x8q 1 12 increasingly increasingly ADV qz20sq89x8q 1 13 relevant relevant ADJ qz20sq89x8q 1 14 to to ADP qz20sq89x8q 1 15 psychologists psychologist NOUN qz20sq89x8q 1 16 as as SCONJ qz20sq89x8q 1 17 it it PRON qz20sq89x8q 1 18 becomes become VERB qz20sq89x8q 1 19 easier easy ADJ qz20sq89x8q 1 20 and and CCONJ qz20sq89x8q 1 21 cheaper cheap ADJ qz20sq89x8q 1 22 to to PART qz20sq89x8q 1 23 collect collect VERB qz20sq89x8q 1 24 data datum NOUN qz20sq89x8q 1 25 on on ADP qz20sq89x8q 1 26 human human ADJ qz20sq89x8q 1 27 behavior behavior NOUN qz20sq89x8q 1 28 . . PUNCT qz20sq89x8q 2 1 this this DET qz20sq89x8q 2 2 dissertation dissertation NOUN qz20sq89x8q 2 3 focuses focus VERB qz20sq89x8q 2 4 on on ADP qz20sq89x8q 2 5 the the DET qz20sq89x8q 2 6 problem problem NOUN qz20sq89x8q 2 7 of of ADP qz20sq89x8q 2 8 identifying identify VERB qz20sq89x8q 2 9 important important ADJ qz20sq89x8q 2 10 structural structural ADJ qz20sq89x8q 2 11 features feature NOUN qz20sq89x8q 2 12 like like ADP qz20sq89x8q 2 13 main main ADJ qz20sq89x8q 2 14 effects effect NOUN qz20sq89x8q 2 15 , , PUNCT qz20sq89x8q 2 16 nonlinear nonlinear ADJ qz20sq89x8q 2 17 effects effect NOUN qz20sq89x8q 2 18 , , PUNCT qz20sq89x8q 2 19 and and CCONJ qz20sq89x8q 2 20 interactions interaction NOUN qz20sq89x8q 2 21 in in ADP qz20sq89x8q 2 22 big big ADJ qz20sq89x8q 2 23 data datum NOUN qz20sq89x8q 2 24 sets set NOUN qz20sq89x8q 2 25 when when SCONJ qz20sq89x8q 2 26 the the DET qz20sq89x8q 2 27 number number NOUN qz20sq89x8q 2 28 of of ADP qz20sq89x8q 2 29 predictors predictor NOUN qz20sq89x8q 2 30 is be AUX qz20sq89x8q 2 31 large large ADJ qz20sq89x8q 2 32 . . PUNCT qz20sq89x8q 3 1 in in ADP qz20sq89x8q 3 2 general general ADJ qz20sq89x8q 3 3 , , PUNCT qz20sq89x8q 3 4 this this DET qz20sq89x8q 3 5 goal goal NOUN qz20sq89x8q 3 6 can can AUX qz20sq89x8q 3 7 be be AUX qz20sq89x8q 3 8 referred refer VERB qz20sq89x8q 3 9 to to ADP qz20sq89x8q 3 10 as as ADP qz20sq89x8q 3 11 exploratory exploratory ADJ qz20sq89x8q 3 12 regression regression NOUN qz20sq89x8q 3 13 analysis analysis NOUN qz20sq89x8q 3 14 . . PUNCT qz20sq89x8q 4 1 exploratory exploratory ADJ qz20sq89x8q 4 2 regression regression NOUN qz20sq89x8q 4 3 analysis analysis NOUN qz20sq89x8q 4 4 is be AUX qz20sq89x8q 4 5 beneficial beneficial ADJ qz20sq89x8q 4 6 because because SCONJ qz20sq89x8q 4 7 the the DET qz20sq89x8q 4 8 results result NOUN qz20sq89x8q 4 9 suggest suggest VERB qz20sq89x8q 4 10 testable testable ADJ qz20sq89x8q 4 11 hypotheses hypothesis NOUN qz20sq89x8q 4 12 , , PUNCT qz20sq89x8q 4 13 can can AUX qz20sq89x8q 4 14 limit limit VERB qz20sq89x8q 4 15 the the DET qz20sq89x8q 4 16 number number NOUN qz20sq89x8q 4 17 of of ADP qz20sq89x8q 4 18 plausible plausible ADJ qz20sq89x8q 4 19 models model NOUN qz20sq89x8q 4 20 , , PUNCT qz20sq89x8q 4 21 and and CCONJ qz20sq89x8q 4 22 help help VERB qz20sq89x8q 4 23 avoid avoid VERB qz20sq89x8q 4 24 errors error NOUN qz20sq89x8q 4 25 in in ADP qz20sq89x8q 4 26 model model NOUN qz20sq89x8q 4 27 specification specification NOUN qz20sq89x8q 4 28 . . PUNCT qz20sq89x8q 5 1 exploratory exploratory ADJ qz20sq89x8q 5 2 regression regression NOUN qz20sq89x8q 5 3 analysis analysis NOUN qz20sq89x8q 5 4 is be AUX qz20sq89x8q 5 5 usually usually ADV qz20sq89x8q 5 6 carried carry VERB qz20sq89x8q 5 7 out out ADP qz20sq89x8q 5 8 using use VERB qz20sq89x8q 5 9 basic basic ADJ qz20sq89x8q 5 10 data datum NOUN qz20sq89x8q 5 11 visualization visualization NOUN qz20sq89x8q 5 12 techniques technique NOUN qz20sq89x8q 5 13 , , PUNCT qz20sq89x8q 5 14 simple simple ADJ qz20sq89x8q 5 15 statistical statistical ADJ qz20sq89x8q 5 16 models model NOUN qz20sq89x8q 5 17 , , PUNCT qz20sq89x8q 5 18 or or CCONJ qz20sq89x8q 5 19 by by ADP qz20sq89x8q 5 20 fitting fit VERB qz20sq89x8q 5 21 a a DET qz20sq89x8q 5 22 number number NOUN qz20sq89x8q 5 23 of of ADP qz20sq89x8q 5 24 parametric parametric NOUN qz20sq89x8q 5 25 models model NOUN qz20sq89x8q 5 26 and and CCONJ qz20sq89x8q 5 27 selecting select VERB qz20sq89x8q 5 28 the the DET qz20sq89x8q 5 29 best good ADJ qz20sq89x8q 5 30 from from ADP qz20sq89x8q 5 31 among among ADP qz20sq89x8q 5 32 them they PRON qz20sq89x8q 5 33 . . PUNCT qz20sq89x8q 6 1 however however ADV qz20sq89x8q 6 2 , , PUNCT qz20sq89x8q 6 3 these these DET qz20sq89x8q 6 4 procedures procedure NOUN qz20sq89x8q 6 5 can can AUX qz20sq89x8q 6 6 require require VERB qz20sq89x8q 6 7 strong strong ADJ qz20sq89x8q 6 8 assumptions assumption NOUN qz20sq89x8q 6 9 and and CCONJ qz20sq89x8q 6 10 may may AUX qz20sq89x8q 6 11 not not PART qz20sq89x8q 6 12 be be AUX qz20sq89x8q 6 13 feasible feasible ADJ qz20sq89x8q 6 14 when when SCONJ qz20sq89x8q 6 15 the the DET qz20sq89x8q 6 16 number number NOUN qz20sq89x8q 6 17 of of ADP qz20sq89x8q 6 18 predictors predictor NOUN qz20sq89x8q 6 19 is be AUX qz20sq89x8q 6 20 large.gradient large.gradient ADJ qz20sq89x8q 6 21 tree tree NOUN qz20sq89x8q 6 22 boosting boost VERB qz20sq89x8q 6 23 ( ( PUNCT qz20sq89x8q 6 24 friedman_greedy_2001 friedman_greedy_2001 PROPN qz20sq89x8q 6 25 ) ) PUNCT qz20sq89x8q 6 26 is be AUX qz20sq89x8q 6 27 a a DET qz20sq89x8q 6 28 promising promising ADJ qz20sq89x8q 6 29 alternative alternative NOUN qz20sq89x8q 6 30 for for ADP qz20sq89x8q 6 31 exploratory exploratory ADJ qz20sq89x8q 6 32 regression regression NOUN qz20sq89x8q 6 33 analysis analysis NOUN qz20sq89x8q 6 34 because because SCONJ qz20sq89x8q 6 35 it it PRON qz20sq89x8q 6 36 builds build VERB qz20sq89x8q 6 37 an an DET qz20sq89x8q 6 38 interpretable interpretable ADJ qz20sq89x8q 6 39 model model NOUN qz20sq89x8q 6 40 that that PRON qz20sq89x8q 6 41 approximates approximate VERB qz20sq89x8q 6 42 nonlinear nonlinear ADJ qz20sq89x8q 6 43 effects effect NOUN qz20sq89x8q 6 44 and and CCONJ qz20sq89x8q 6 45 interactions interaction NOUN qz20sq89x8q 6 46 among among ADP qz20sq89x8q 6 47 predictors predictor NOUN qz20sq89x8q 6 48 without without ADP qz20sq89x8q 6 49 a a DET qz20sq89x8q 6 50 priori priori ADJ qz20sq89x8q 6 51 specification specification NOUN qz20sq89x8q 6 52 . . PUNCT qz20sq89x8q 7 1 however however ADV qz20sq89x8q 7 2 , , PUNCT qz20sq89x8q 7 3 it it PRON qz20sq89x8q 7 4 is be AUX qz20sq89x8q 7 5 not not PART qz20sq89x8q 7 6 clear clear ADJ qz20sq89x8q 7 7 how how SCONJ qz20sq89x8q 7 8 to to PART qz20sq89x8q 7 9 build build VERB qz20sq89x8q 7 10 and and CCONJ qz20sq89x8q 7 11 interpret interpret VERB qz20sq89x8q 7 12 gradient gradient NOUN qz20sq89x8q 7 13 tree tree NOUN qz20sq89x8q 7 14 boosting boost VERB qz20sq89x8q 7 15 models model NOUN qz20sq89x8q 7 16 in in ADP qz20sq89x8q 7 17 the the DET qz20sq89x8q 7 18 context context NOUN qz20sq89x8q 7 19 of of ADP qz20sq89x8q 7 20 multivariate multivariate NOUN qz20sq89x8q 7 21 , , PUNCT qz20sq89x8q 7 22 longitudinal longitudinal ADJ qz20sq89x8q 7 23 , , PUNCT qz20sq89x8q 7 24 and and CCONJ qz20sq89x8q 7 25 hierarchically hierarchically ADV qz20sq89x8q 7 26 clustered cluster VERB qz20sq89x8q 7 27 data datum NOUN qz20sq89x8q 7 28 commonly commonly ADV qz20sq89x8q 7 29 found find VERB qz20sq89x8q 7 30 in in ADP qz20sq89x8q 7 31 psychological psychological ADJ qz20sq89x8q 7 32 research.this research.this NOUN qz20sq89x8q 7 33 dissertation dissertation NOUN qz20sq89x8q 7 34 develops develop VERB qz20sq89x8q 7 35 two two NUM qz20sq89x8q 7 36 procedures procedure NOUN qz20sq89x8q 7 37 for for ADP qz20sq89x8q 7 38 estimating estimate VERB qz20sq89x8q 7 39 gradient gradient NOUN qz20sq89x8q 7 40 tree tree NOUN qz20sq89x8q 7 41 boosting boost VERB qz20sq89x8q 7 42 models model NOUN qz20sq89x8q 7 43 for for ADP qz20sq89x8q 7 44 multivariate multivariate NOUN qz20sq89x8q 7 45 , , PUNCT qz20sq89x8q 7 46 longitudinal longitudinal ADJ qz20sq89x8q 7 47 and and CCONJ qz20sq89x8q 7 48 hierarchically hierarchically ADV qz20sq89x8q 7 49 clustered cluster VERB qz20sq89x8q 7 50 data datum NOUN qz20sq89x8q 7 51 . . PUNCT qz20sq89x8q 8 1 multivariate multivariate NOUN qz20sq89x8q 8 2 tree tree NOUN qz20sq89x8q 8 3 boosting boost VERB qz20sq89x8q 8 4 selects select NOUN qz20sq89x8q 8 5 predictors predictor NOUN qz20sq89x8q 8 6 that that PRON qz20sq89x8q 8 7 explain explain VERB qz20sq89x8q 8 8 covariance covariance NOUN qz20sq89x8q 8 9 in in ADP qz20sq89x8q 8 10 multiple multiple ADJ qz20sq89x8q 8 11 outcomes outcome NOUN qz20sq89x8q 8 12 . . PUNCT qz20sq89x8q 9 1 mixed mixed ADJ qz20sq89x8q 9 2 effects effect NOUN qz20sq89x8q 9 3 tree tree NOUN qz20sq89x8q 9 4 boosting boosting NOUN qz20sq89x8q 9 5 takes take VERB qz20sq89x8q 9 6 hierarchically hierarchically ADV qz20sq89x8q 9 7 clustered cluster VERB qz20sq89x8q 9 8 data datum NOUN qz20sq89x8q 9 9 into into ADP qz20sq89x8q 9 10 account account NOUN qz20sq89x8q 9 11 by by ADP qz20sq89x8q 9 12 treating treat VERB qz20sq89x8q 9 13 a a DET qz20sq89x8q 9 14 grouping group VERB qz20sq89x8q 9 15 variable variable NOUN qz20sq89x8q 9 16 as as ADP qz20sq89x8q 9 17 random random ADJ qz20sq89x8q 9 18 . . PUNCT qz20sq89x8q 10 1 longitudinal longitudinal ADJ qz20sq89x8q 10 2 data datum NOUN qz20sq89x8q 10 3 can can AUX qz20sq89x8q 10 4 be be AUX qz20sq89x8q 10 5 modeled model VERB qz20sq89x8q 10 6 in in ADP qz20sq89x8q 10 7 boosted boosted ADJ qz20sq89x8q 10 8 decision decision NOUN qz20sq89x8q 10 9 trees tree NOUN qz20sq89x8q 10 10 by by ADP qz20sq89x8q 10 11 including include VERB qz20sq89x8q 10 12 time time NOUN qz20sq89x8q 10 13 as as ADP qz20sq89x8q 10 14 a a DET qz20sq89x8q 10 15 candidate candidate NOUN qz20sq89x8q 10 16 for for ADP qz20sq89x8q 10 17 splitting splitting NOUN qz20sq89x8q 10 18 in in ADP qz20sq89x8q 10 19 mixed mixed ADJ qz20sq89x8q 10 20 effects effect NOUN qz20sq89x8q 10 21 tree tree NOUN qz20sq89x8q 10 22 boosting boost VERB qz20sq89x8q 10 23 . . PUNCT qz20sq89x8q 11 1 these these DET qz20sq89x8q 11 2 procedures procedure NOUN qz20sq89x8q 11 3 are be AUX qz20sq89x8q 11 4 illustrated illustrate VERB qz20sq89x8q 11 5 by by ADP qz20sq89x8q 11 6 application application NOUN qz20sq89x8q 11 7 to to ADP qz20sq89x8q 11 8 real real ADJ qz20sq89x8q 11 9 data datum NOUN qz20sq89x8q 11 10 . . PUNCT qz20sq89x8q 12 1 simulations simulation NOUN qz20sq89x8q 12 2 demonstrate demonstrate VERB qz20sq89x8q 12 3 that that SCONJ qz20sq89x8q 12 4 the the DET qz20sq89x8q 12 5 methods method NOUN qz20sq89x8q 12 6 balance balance VERB qz20sq89x8q 12 7 true true ADJ qz20sq89x8q 12 8 and and CCONJ qz20sq89x8q 12 9 false false ADJ qz20sq89x8q 12 10 positive positive ADJ qz20sq89x8q 12 11 rates rate NOUN qz20sq89x8q 12 12 when when SCONJ qz20sq89x8q 12 13 selecting select VERB qz20sq89x8q 12 14 variables variable NOUN qz20sq89x8q 12 15 , , PUNCT qz20sq89x8q 12 16 and and CCONJ qz20sq89x8q 12 17 achieve achieve VERB qz20sq89x8q 12 18 low low ADJ qz20sq89x8q 12 19 prediction prediction NOUN qz20sq89x8q 12 20 error error NOUN qz20sq89x8q 12 21 at at ADP qz20sq89x8q 12 22 sample sample NOUN qz20sq89x8q 12 23 and and CCONJ qz20sq89x8q 12 24 effect effect NOUN qz20sq89x8q 12 25 sizes size NOUN qz20sq89x8q 12 26 commonly commonly ADV qz20sq89x8q 12 27 observed observe VERB qz20sq89x8q 12 28 in in ADP qz20sq89x8q 12 29 psychology psychology NOUN qz20sq89x8q 12 30 . . PUNCT