Skip to content

Latest commit

 

History

History
612 lines (547 loc) · 19 KB

File metadata and controls

612 lines (547 loc) · 19 KB

目录

针对的内容是接下来需要建立的统计的数据值信息:

  1. Professional: 专业状态,包括的信息内容为:

    • Student
    • Professional developer
    • Professional non-developer who sometimes writes code
    • Used to be a professional developer
    • None of these
    • NAN——2018 年数据值
  2. Employment:工作状态,2017 年的字段 EmploymentStatus 名称需要修正。包括的信息内容:

    • Employed full-time
    • Independent contractor, freelancer, or self-employed
    • Employed part-time
    • Not employed, and not looking for work
    • I prefer not to say
    • Retired
  3. University:2018 年的字段为 Student,名称需要修正。包括的信息内容:

    • No
    • Yes, full-time
    • Yes, part-time
    • I prefer not to say
  4. FormalEducation:正式受教育情况,注意 2018 年信息修改。包括的信息内容:

    • Bachelor's degree
    • Master's degree
    • Some college/university study without earning a bachelor's degree
    • Secondary school
    • Doctoral degree
    • I prefer not to answer
    • Primary/elementary school
    • Professional degree
    • I never completed any formal education
  5. UndergradMajor:大学专业,保留 2017 年和 2018 年数据。但是 2017 年字段名 MajorUndergrad ,要修改。包括信息内容:

    2017 年的信息

    • Computer science or software engineering
    • Computer engineering or electrical/electronics engineering
    • Computer programming or Web development
    • Information technology, networking, or system administration
    • A natural science
    • A non-computer-focused engineering discipline
    • Mathematics or statistics
    • Something else
    • A humanities discipline
    • A business discipline
    • Management information systems
    • Fine arts or performing arts
    • A social science
    • I never declared a major
    • Psychology
    • A health science

    2018 年信息

    • Computer science, computer engineering, or software engineering
    • Another engineering discipline (ex. civil, electrical, mechanical)
    • Information systems, information technology, or system administration
    • A natural science (ex. biology, chemistry, physics)
    • Mathematics or statistics
    • Web development or web design
    • A business discipline (ex. accounting, finance, marketing)
    • A humanities discipline (ex. literature, history, philosophy)
    • A social science (ex. anthropology, psychology, political science)
    • Fine arts or performing arts (ex. graphic design, music, studio art)
    • I never declared a major
    • A health science (ex. nursing, pharmacy, radiology)
  6. Gender:性别,字段信息:

    • Male
    • Female
    • NoComment——其他多选项信息
  7. SkipMeals:废寝忘食,字段信息

    • Never
    • 1 - 2 times per week
    • 3 - 4 times per week
    • Daily or almost every day
    • NAN——2017 年没有该数据
  8. Race:人种信息,2018 年字段 RaceEthnicity。对于多字段信息,采取了直接取得第一个分号前值。包括字段复选框字段:

    • Black or of African descent
    • East Asian
    • Hispanic or Latino/Latina
    • Middle Eastern
    • Native American, Pacific Islander, or Indigenous Australian
    • South Asian
    • White or of European descent
    • I don’t know ——只要包括这个数据,变更数据为 NoInfo
    • I prefer not to say ——只要包括这个数据,变更数据为 NoInfo
    • NoInfo——没有明确信息的
  9. Age:年龄段,2017 年无该字段,包括字段信息:

    • I prefer not to answer
    • Under 18 years old
    • 18 - 24 years old
    • 25 - 34 years old
    • 35 - 44 years old
    • 45 - 54 years old
    • 55 - 64 years old
    • 65 years or older
    • NAN——2017 年无数据
  10. Country:国家,包括的是国家信息

  11. Salary:薪资,2018 年为 ConvertedSalary 字段名需要修改,连续型数值信息

  12. Currency:薪资结算符号,2018 年字段名称为 CurrencySymbol

    • U.S. dollars
    • Euros
    • British pounds sterling
    • Japanese yen
    • Chinese yuan renminbi
    • Brazilian reais
    • Indian rupees
    • Mexican pesos
    • South African rands
    • Swedish kroner
    • Australian dollars
    • Canadian dollars
    • Singapore dollars
    • Russian rubles
    • Swiss francs
    • Polish złoty
    • Bitcoin
    • other currency:
  13. SalaryType:薪资结算类型,2017 年增加该字段信息值为 Yearly,包括字段信息:

    • Monthly
    • Yearly
    • Weekly
  14. CompanySize:公司规模,包括的字段信息

    1. Fewer than 10 employees
    2. 10 to 19 employees
    3. 20 to 99 employees
    4. 100 to 499 employees
    5. 500 to 999 employees
    6. 1,000 to 4,999 employees
    7. 5,000 to 9,999 employees
    8. 10,000 or more employees
    9. NoInfo——I don't know 信息 和 I prefer not to answer 信息替换为 NoInfo
  15. DeveloperType:开发类型,是以一个复选信息字段,可能包括的字段信息。对于 2017 年和 2018 年的信息存在表达不一致的情况

    2017年 的信息:
        'Web developer': 'Web developer',
        'Mobile developer': 'Mobile developer',
        'Desktop applications developer': 'Desktop or enterprise applications developer',
        'Other': 'Other',
        'Embedded applications/devices developer': 'Embedded applications or devices developer',
        'Developer with a statistics or mathematics background': 'Developer with a statistics or mathematics background',
        'Data scientist': 'Data scientist or machine learning specialist',
        'DevOps specialist': 'DevOps specialist',
        'Quality assurance engineer': 'QA or test developer',
        'Database administrator': 'Database administrator',
        'Graphics programming': 'Graphics programming',
        'Machine learning specialist': 'Data scientist or machine learning specialist',
        'Systems administrator': 'Systems administrator',
        'Graphic designer': 'Designer',
        'Systems administrator' : 'System administrator',
        'Developer with a statistics or mathematics background': 'Educator or academic researcher',
        'Graphics programming' : 'Game or graphics developer'
    
    2018 年信息
    Back-end developer                     
    Full-stack developer                           
    Mobile developer                                
    Student                                         
    Front-end developer                              
    Desktop or enterprise applications developer    
    Data scientist or machine learning specialist    
    Embedded applications or devices developer     
    QA or test developer                             
    Data or business analyst                          
    C-suite executive (CEO, CTO, etc.)              
    DevOps specialist                               
    Engineering manager                               
    Educator or academic researcher                 
    System administrator                             
    Game or graphics developer                        
    Designer                                          
    Product manager                                  
    Database administrator                           
    

    当前已确认信息:

    • Back-end developer
    • C-suite executive (CEO CTO etc.)
    • Data or business analyst
    • Data scientist or machine learning specialist
    • Database administrator
    • Designer
    • Desktop or enterprise applications developer
    • DevOps specialist
    • Educator or academic researcher
    • Embedded applications or devices developer
    • Engineering manager
    • Front-end developer
    • Full-stack developer
    • Game or graphics developer
    • Marketing or sales professional
    • Mobile developer
    • Other
    • Product manager
    • QA or test developer
    • Student
    • System administrator
    • Web developer
  16. JobSatisfaction:工作满意度,2017 年和 2018 年数据不一致,修改后保存信息:

    • Extremely dissatisfied : = 0 Moderately dissatisfied : <= 2 Slightly dissatisfied : <= 4 Neither satisfied nor dissatisfied : = 5 Slightly satisfied : <= 7 Moderately satisfied : <= 9 Extremely satisfied : = 10
  17. JobSeekingStatus:求职状态,2018 年的字段名称需要修改(JobSearchStatus)。字段信息如下:

    1. I am actively looking for a job
    2. I’m not actively looking, but I am open to new opportunities
    3. I am not interested in new job opportunities
  18. YearsCoding:编程年限,2017 年中包括了两个字段(YearsCodedJob 与 YearsCodedJobPast)说明了曾经或者过去的的编程年限。为与 2018 年的数据统一,将 2017 年的两个字段统一为一个 YearsCoding(从数据内容上来看,具有可行性—— YearsCodedJobPast 数据量少,可以统一到 YearsCodedJob 作补充数据, 以时间年限上最大的数据作为值。这样说明的是整体有多少编程年限)

    • 0-2 years
    • 3-5 years
    • 6-8 years
    • 9-11 years
    • 12-14 years
    • 15-17 years
    • 18-20 years
    • 20 or more years
  19. LanguageDesireNextYear :下一年中期望使用语言,在 2017 年中字段为 WantWorkLanguage,名称存在差异,此外值中存在多余的空格。字段数据值为:

  • Assembly
  • Bash/Shell
  • C
  • C#
  • C++
  • CSS
  • Clojure
  • Cobol
  • CoffeeScript
  • Common Lisp
  • Dart
  • Delphi/Object Pascal
  • Elixir
  • Erlang
  • F#
  • Go
  • Groovy
  • HTML
  • Hack
  • Haskell
  • Java
  • JavaScript
  • Julia
  • Kotlin
  • Lua
  • Matlab
  • Objective-C
  • Ocaml
  • PHP
  • Perl
  • Python
  • R
  • Ruby
  • Rust
  • SQL
  • Scala
  • Smalltalk
  • Swift
  • TypeScript
  • VB.NET
  • VBA
  • Visual Basic 6
  1. LanguageWorkedWith:目前正在使用的语言,在 2017 年中字段为 HaveWorkedLanguage,名称存在差异,此外值中存在多余的空格。字段数据值为:
  • Assembly
  • Bash/Shell
  • C
  • C#
  • C++
  • CSS
  • Clojure
  • Cobol
  • CoffeeScript
  • Common Lisp
  • Dart
  • Delphi/Object Pascal
  • Elixir
  • Erlang
  • F#
  • Go
  • Groovy
  • HTML
  • Hack
  • Haskell
  • Java
  • JavaScript
  • Julia
  • Kotlin
  • Lua
  • Matlab
  • Objective-C
  • Ocaml
  • PHP
  • Perl
  • Python
  • R
  • Ruby
  • Rust
  • SQL
  • Scala
  • Smalltalk
  • Swift
  • TypeScript
  • VB.NET
  • VBA
  • Visual Basic 6
  1. DatabaseDesireNextYear:下一年期待使用数据库,2017 年的名称为 WantWorkDatabase,字段名称不一致,数据中存在不必要空格。数据值为:
  • Amazon DynamoDB
  • Amazon RDS/Aurora
  • Amazon Redshift
  • Apache HBase
  • Apache Hive
  • Cassandra
  • Elasticsearch
  • Google BigQuery
  • Google Cloud Storage
  • IBM Db2
  • MariaDB
  • Memcached
  • Microsoft Azure (Tables, CosmosDB, SQL, etc)
  • MongoDB
  • MySQL
  • Neo4j
  • Oracle
  • PostgreSQL
  • Redis
  • SQL Server
  • SQLite
  1. DatabaseWorkedWith :当前正在使用的数据库,2017 年的名称为 HaveWorkedDatabase,字段名称不一致,数据中存在不必要空格。数据值为:
  • Amazon DynamoDB
  • Amazon RDS/Aurora
  • Amazon Redshift
  • Apache HBase
  • Apache Hive
  • Cassandra
  • Elasticsearch
  • Google BigQuery
  • Google Cloud Storage
  • IBM Db2
  • MariaDB
  • Memcached
  • Microsoft Azure (Tables, CosmosDB, SQL, etc)
  • MongoDB
  • MySQL
  • Neo4j
  • Oracle
  • PostgreSQL
  • Redis
  • SQL Server
  • SQLite
  1. PlatformWorkedWith :2017 年字段名称为HaveWorkedPlatform,字段名称不一致,此外存在多余空格。 数据信息为:
  • AWS
  • Amazon Echo
  • Amazon Web Services (AWS)
  • Android
  • Apple Watch or Apple TV
  • Arduino
  • Azure
  • Drupal
  • ESP8266
  • Firebase
  • Gaming console
  • Google Cloud Platform/App Engine
  • Google Home
  • Heroku
  • IBM Cloud or Watson
  • Linux
  • Linux Desktop
  • Mac OS
  • Mainframe
  • Microsoft Azure
  • Predix
  • Raspberry Pi
  • Salesforce
  • Serverless
  • SharePoint
  • Windows Desktop
  • Windows Desktop or Server
  • Windows Phone
  • WordPress
  • iOS
  1. PlatformDesireNextYear :2017 年字段名称为WantWorkPlatform,字段名称不一致,此外存在多余空格。 数据信息为:
  • AWS
  • Amazon Echo
  • Amazon Web Services (AWS)
  • Android
  • Apple Watch or Apple TV
  • Arduino
  • Azure
  • Drupal
  • ESP8266
  • Firebase
  • Gaming console
  • Google Cloud Platform/App Engine
  • Google Home
  • Heroku
  • IBM Cloud or Watson
  • Linux
  • Linux Desktop
  • Mac OS
  • Mainframe
  • Microsoft Azure
  • Predix
  • Raspberry Pi
  • Salesforce
  • Serverless
  • SharePoint
  • Windows Desktop
  • Windows Desktop or Server
  • Windows Phone
  • WordPress
  • iOS
  1. Hobby:2017 年中的信息在 ProgramHobby 中,对 Yes, I program as a hobby 和 Yes, both 提取数据为 Yes,其他选项数据处理为 No

    • Yes
    • No
  2. OpenSource:2017 年的信息在 ProgramHobby 中,对 Yes, I contribute to oopen source projects 和 Yes, both 提取数据为 Yes, 其他选项数据处理为 No

    • Yes
    • No
  3. WorkStart:2018 年没有该字段

    • 9:00 AM
    • 10:00 AM
    • 8:00 AM
    • 7:00 AM
    • 11:00 AM
    • 6:00 AM
    • Noon
    • 2:00 PM
    • 1:00 PM
    • 5:00 AM
    • 8:00 PM
    • 6:00 PM
    • 4:00 PM
    • 10:00 PM
    • 3:00 PM
    • 9:00 PM
    • 5:00 PM
    • 7:00 PM
    • 4:00 AM
    • Midnight
    • 11:00 PM
    • 1:00 AM
    • 3:00 AM
    • 2:00 AM
  4. UpdateCV:2017 年字段为 ResumePrompted,需要修改字段名。同时 2017 年和 2018 年字段的值差异太大需要确认信息

    2017 年
    I was just giving it a regular update
    A recruiter contacted me
    Something else
    I saw an employer's advertisement
    I completed a major project, assignment, or contract
    A friend told me about a job opportunity
    I received bad news about the future of my company or depart
    I received negative feedback on my job performance 
    
    2018 年
    My job status or other personal status changed
    A recruiter contacted me
    I had a negative experience or interaction at work
    A friend told me about a job opportunity
    I saw an employer’s advertisement
    I did not receive an expected change in compensation
    I received bad news about the future of my company or department
    I received negative feedback on my job performance
    
    • A friend told me about a job opportunity
    • A recruiter contacted me
    • I did not receive an expected change in compensation
    • I had a negative experience or interaction at work
    • I received bad news about the future of my company or department
    • I received negative feedback on my job performance
    • I saw an employer’s advertisement
    • I was just giving it a regular update
    • My job status or other personal status changed
    • Something else
  5. MetricAssess:2018 年没有该字段

    • Benchmarked product performance,
    • Bugs found,
    • Commit frequency,
    • Customer satisfaction,
    • Hours worked,
    • Lines of code,
    • Manager's rating,
    • On time/in budget,
    • Other,
    • Peer's rating,
    • Release frequency,
    • Revenue performance,
    • Self-rating
  6. LastNewJob: 2017 年中字段值 Not applicable/ never 修改为 I've never had a job

    • Less than a year ago

    • Between 1 and 2 years ago

    • Between 2 and 4 years ago

    • More than 4 years ago

    • I've never had a job

  7. SelfTaughtTypes:2018 年和 2017 年字段的值差别较大:

    2018 年字段值
    self_taught = {
        'A book or e-book from O’Reilly, Apress, or a similar publisher' : "Textbook",
        'A college/university computer science or software engineering book' : 'Trade book',
        'Internal Wikis, chat rooms, or documentation set up by my company for employees' :'Company internal community',
        'Online developer communities other than Stack Overflow (ex. forums, listservs, IRC channels, etc.)' : 'Non-Stack online communities',
        'Pre-scheduled tutoring or mentoring sessions with a friend or colleague': 'Tutoring/mentoring',
        'Questions & answers on Stack Overflow':'Stack Overflow Q&A',
        'Tapping your network of friends, family, and peers versed in the technology' : 'Friends network',
        'The official documentation and/or standards for the technology' : 'Official documentation',
        'The technology’s online help system' : 'Built-in help',
    }
    
    • Built-in help
    • Company internal community
    • Friends network
    • Non-Stack online communities
    • Official documentation
    • Other
    • Stack Overflow Docs
    • Stack Overflow Q&A
    • Textbook
    • Trade book
    • Tutoring/mentoring'
  8. TimeAfterBootcamp:2017 年的字段值有差异,需要修改。最终得到的结果如下:

    • Four to six months,
    • I already had a full-time job as a developer when I began the program,
    • I got a job as a developer before completing the program,
    • I haven’t gotten a developer job,
    • Immediately after graduating,
    • Less than a month,
    • Longer than a year,
    • One to three months,
    • Six months to a year
  9. EducationTypes:2017 年和 2018 年字段值存在差异需要确认修改,修改 2018 年字段值

    2018 年字段值
    education_types = {
        'Completed an industry certification program (e.g. MCPD)':'Industry certification',
        'Contributed to open source software' :'Open source contributions',
        'Participated in a full-time developer training program or bootcamp' : 'Bootcamp',
        'Participated in a hackathon' : 'Hackathon',
        'Participated in online coding competitions (e.g. HackerRank, CodeChef, TopCoder)' : 'Coding competition',
        'Received on-the-job training in software development' : 'On-the-job training',
        'Taken a part-time in-person course in programming or software development' : 'Part-time/evening course',
        'Taken an online course in programming or software development (e.g. a MOOC)' : 'Online course',
        'Taught yourself a new language, framework, or tool without taking a formal course' : 'Self-taught',
    }
    
    • Bootcamp,
    • Coding competition,
    • Hackathon,
    • Industry certification,
    • On-the-job training,
    • Online course,
    • Open source contributions,
    • Part-time/evening course,
    • Self-taught