Dear All, I have this dataset in PDF format (the files are here Array , Array ) as follows. Array
Array

I copy the data, and past it into excel file (Please let me know if you need this file), and import it into Stata
Code:
* Example generated by -dataex-. To install: ssc install dataex
clear
input str97 A
"1 财经研究6109 3.135 3.121 0.315 1659 1.276 1.263 0.248 1.219 1.205 0.235 1"                  
"2 经济理论与经济管理4721 2.272 2.248 0.307 1169 0.957 0.933 0.240 0.896 0.872 0.233 1"   
"3 财经科学4510 2.179 2.165 0.270 1085 0.830 0.816 0.164 0.774 0.759 0.164 1"                  
"4 经济学家3719 2.080 2.046 0.231 1074 0.966 0.933 0.181 0.924 0.891 0.176 1"                  
"5 财经问题研究5293 1.965 1.921 0.149 1287 0.778 0.734 0.100 0.723 0.679 0.100 1"            
"6 当代经济科学3030 1.944 1.915 0.188 717 0.831 0.803 0.168 0.770 0.742 0.149 1"             
"7 经济评论4053 1.944 1.924 0.297 851 0.738 0.718 0.188 0.711 0.691 0.172 1"                   
"8 宏观经济研究2851 1.915 1.906 0.396 819 0.848 0.839 0.287 0.763 0.754 0.256 2"             
"9 上海财经大学学报1267 1.880 1.861 0.132 370 0.848 0.829 0.118 0.823 0.804 0.118 1"       
"10 东南学术# 172 1.843 0.200 60"                                                              
"11 产业经济研究1102 1.783 1.783 0.082 306 0.659 0.659 0.055 0.620 0.620 0.041 1"            
"12 当代财经6344 1.774 1.738 0.222 1456 0.670 0.633 0.158 0.643 0.607 0.158 1"                 
"13 经济与管理研究2497 1.769 1.756 0.202 680 0.716 0.704 0.165 0.678 0.666 0.157 1"         
"14 学术月刊# 154 1.644 0.200 69"                                                              
"15 中央财经大学学报3798 1.605 1.584 0.170 858 0.600 0.579 0.117 0.561 0.540 0.117 1"      
"16 山西财经大学学报3298 1.594 1.573 0.138 911 0.706 0.686 0.133 0.649 0.629 0.128 1"      
"17 广东社会科学# 156 1.554 0.271 60"                                                        
"18 社会科学# 184 1.540 0.182 77"                                                              
"19 预测3731 1.521 1.479 0.073 855 0.647 0.605 0.061 0.509 0.467 0.061 1"                        
"20 经济社会体制比较3681 1.501 1.475 0.118 1008 0.642 0.616 0.084 0.604 0.578 0.084 1"     
"21 学术研究# 148 1.500 0.140 52"                                                              
"22 重庆大学学报(社会科学版)# 191 1.465 0.034 74"                                       
"23 中国地质大学学报(社会科学版)# 132 1.453 0.156 49"                                 
"24 北京邮电大学学报(社会科学版)# 126 1.440 0.098 48"                                 
"25 经济问题探索5170 1.433 1.406 0.138 1542 0.639 0.612 0.116 0.558 0.530 0.111 1"           
"26 山东大学学报(哲学社会科学版)# 133 1.398 0.217 55"                                 
"27 浙江社会科学# 130 1.371 0.190 55"                                                        
"28 甘肃社会科学# 185 1.341 0.119 94"                                                        
"29 西安交通大学学报(社会科学版)# 112 1.338 0.225 50"                                 
"30 中南财经政法大学学报2643 1.336 1.326 0.121 602 0.485 0.476 0.067 0.450 0.440 0.067 1"
end
Of course, I used -split- command to split the variable, but found the some of the data are in wrong columns, likely due to missing data in the raw data. I wonder if anyone can give some suggestions to this diffcult question (because we have LOTS of data like this kind). Thanks in advance.