Skip to content

Commit ff3e67f

Browse files
Update readme.md
1 parent 887cbb7 commit ff3e67f

File tree

1 file changed

+26
-78
lines changed

1 file changed

+26
-78
lines changed

readme.md

Lines changed: 26 additions & 78 deletions
Original file line numberDiff line numberDiff line change
@@ -1,121 +1,69 @@
1-
<img src="https://stackoverflow.design/assets/img/logos/so/logo-stackoverflow.png" align="left" height="100" width="450" >
2-
<br>
3-
<br>
4-
<br>
5-
<br>
6-
<br>
7-
<br>
1+
# Stack Overflow Analysis Guidelines
82

9-
[![MIT License](https://img.shields.io/apm/l/atomic-design-ui.svg?)](https://github.com/tterb/atomic-design-ui/blob/master/LICENSEs)
10-
<img src="https://img.shields.io/github/last-commit/Sanjayviswa/Stackoverflow-Analysis">
11-
<img src="https://img.shields.io/github/languages/code-size/Sanjayviswa/Stackoverflow-Analysis">
3+
## 👨‍💻 Demo Video
124

13-
# Stackoverflow Analysis Guidelines
14-
## 👨‍💻 Demo video
15-
16-
17-
https://user-images.githubusercontent.com/30715153/168960157-e9448ea4-206c-44c0-bbd5-5e4770c0411f.mp4
18-
19-
You can start working with repo with simple changes and i have updated couple of issue .
20-
Check out the Main Code: [Stackoverflow-Analysis](https://github.com/sanjay-kv/Stackoverflow-Analysis/blob/main/Stackoverflow_Survey_Analysis.ipynb)
5+
[Watch the demo video](https://user-images.githubusercontent.com/30715153/168960157-e9448ea4-206c-44c0-bbd5-5e4770c0411f.mp4)
216

227
## 👇 Prerequisites
238

24-
Before installation, please make sure you have already installed the following tools:
9+
Before installation, please ensure you have the following tools installed:
2510

26-
27-
- [Git](https://git-scm.com/downloads) Learn Git step-by-step by following the instructions provided [here](https://recodehive.com/how-to-install-git-git-tutorial/).
28-
- [Git](https://git-scm.com/downloads) Learn Step by Step instruction here.
29-
- [recodehive](https://recodehive.com/how-to-install-git-git-tutorial/)
11+
- [Git](https://git-scm.com/downloads) - Learn Git step-by-step by following the instructions provided [here](https://recodehive.com/how-to-install-git-git-tutorial/).
3012
- [Anaconda](https://anaconda.org/anaconda)
3113
- [Jupyter Package](https://anaconda.org/anaconda/jupyter)
3214

3315
## 🛠️ Installation Steps
3416

35-
1. Fork the project
36-
Fork the sanjay-kv/Stackoverflow-Analysis/ repository
37-
Follow these instructions on [how to fork a repository](https://help.github.com/en/articles/fork-a-repo)
38-
2. Clone the project
39-
git clone [email protected]:your-username/Stackoverflow-Analysis.git
40-
3. Download the orginal data from the [drive link](https://drive.google.com/drive/folders/13W20DfCW2W5GEeKTYTl7R6xV5hmPS2Do?usp=sharing)
41-
4. Open Jupyter Notebook and place the file in the project folder *Make sure you selecting the correct path*
17+
1. **Fork the project**: Fork the `sanjay-kv/Stackoverflow-Analysis` repository. Follow these instructions on [how to fork a repository](https://help.github.com/en/articles/fork-a-repo).
18+
2. **Clone the project**: `git clone [email protected]:your-username/Stackoverflow-Analysis.git`
19+
3. **Download the original data** from the [drive link](https://drive.google.com/drive/folders/13W20DfCW2W5GEeKTYTl7R6xV5hmPS2Do?usp=sharing)
20+
4. **Open Jupyter Notebook** and place the file in the project folder. Make sure you're selecting the correct path.
4221

4322
## Development
4423

45-
We love your desire to give back, and want to make the process as welcoming to newcomers and experts as possible. We're working on developing more intuitive tutorials for individuals of all skill levels and expertise, so if you think the community would value from being walked through the steps you're going through please share! ❤️
46-
47-
## Finding Insights from Stackoverflow Developer Survey
48-
49-
Stack overflow is a professional community for developers, Stackoverflow conducts a survey every year the collected data from 2011 has been available for open source on the web with the latest dataset 2020 released on March 5th, 2021. If the dataset analysed professionally using modern tools, would enable us to answer real-world questions effectively. The dataset has covered 275 questions in total.
50-
51-
### Project Goal:
52-
53-
1. To perform Analysis on 3 years Stackoverflow Dataset and get insights.
54-
2. To perform Data Analysis and answer the below questions.
55-
+ Impact of higher education on salary of the surveyed developers.
56-
+ Impact of education/experience/responsibilities on gender inequalities.
57-
+ Impact on participation rate due to different ethnicity.
58-
+ To find whether there is any difference between men and women's income.
59-
+ Impact on the increase in popularity of a language in the current year due to developer's interest in the previous year.
60-
61-
3. To perform data visualization on
62-
63-
- The most commonly used language.
24+
We welcome contributions from all levels of experience. If you think the community would benefit from being walked through the steps you're going through, please share! ❤️
6425

65-
- Distribution of surveyors based on their developer role.
26+
## Finding Insights from Stack Overflow Developer Survey
6627

67-
- Factors affecting Job satisfaction.
28+
Stack Overflow is a professional community for developers, conducting a survey annually. Analyzing the dataset professionally using modern tools can enable us to answer real-world questions effectively. The dataset covers 275 questions in total.
6829

69-
- Predicting the growth of languages for upcoming years based on the survey answers.
30+
### Project Goals:
7031

71-
###### The Insights can be used to provide information regarding IT environment, hiring employees and job seekers and build a solid résumé.
32+
1. Perform Analysis on the last 3 years' Stack Overflow Dataset to extract insights.
33+
2. Analyze the impact of higher education, experience, and responsibilities on salary and gender inequalities.
34+
3. Investigate participation rates based on ethnicity and differences in income between men and women.
35+
4. Explore the popularity of programming languages and predict their growth based on survey responses.
7236

7337
### Data Source and Background
7438

75-
https://user-images.githubusercontent.com/30715153/169042852-150e59cf-b742-40bb-bcbd-c34a330c1518.mp4
76-
77-
78-
The dataset is very diverse and came from a [Stackoverflow developer survey](https://insights.stackoverflow.com/survey/?_ga=2.208907280.304952146.1616422967-1864686930.1616422967) with 275 questions answered from 180 countries. Stackoverflow has data collected through surveys from 2011 to 2020, but for the project, the purpose is to analyze the data of the last 3 years. The people who completed the survey mostly from the US, India, and EMEA regions. The majority of the survey respondents had the background of developer/ coding experience. The data are available in the CSV format ranging from 40 to 150 MB with data of 1.5 Lakh survey participants.The dataset includes survey data gathered from 180 countries, the response ranges from Not at all important to very important/ Not at all satisfied to very satisfied.
39+
The dataset comes from the annual Stack Overflow developer survey, covering responses from developers in 180 countries. The data are available in CSV format, ranging from 40 to 150 MB, with responses from 1.5 Lakh survey participants.
7940

8041
### Data Format
8142

82-
The data is in a schema CSV file that consists of 252,199 observations and 62 variables.
83-
84-
### Projected work needs to be done for Insights.
85-
86-
###### Data Wrangling
87-
88-
**Dealing Null Values**: As this is a developer survey and few questions left unanswered by the respondents as '*NA*' or '*Not Applicable*' so dealing with null values is important to get precise information. Data conversion/ manipulation is also required, as the developer responded to the survey through radio buttons rather than yes or no pattern(Univariate analysis).
43+
The data is in a CSV file format with 252,199 observations and 62 variables.
8944

90-
###### Techniques expect to use in the project
45+
### Expected Work
9146

92-
Planning to use ML Algorithms like Random, may include, KNN, AUC for classification problems, training model, logistic regression,data visualization, parameter analysis, Linear Regreesion, Root Mean square.
47+
Data wrangling tasks include handling null values and converting data for analysis. Techniques such as ML algorithms and data visualization will be employed.
9348

9449
## 👨‍💻 Contributing
9550

96-
- Contributions make the open source community such an amazing place to learn, inspire, and create.
97-
- Any contributions you make are **greatly appreciated**.
98-
- Check out our contribution guidelines(yet to update) for more information.
51+
- Contributions are greatly appreciated. Check out our contribution guidelines (yet to be updated) for more information.
9952

10053
## 🛡️ License
10154

102-
LinkFree is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
55+
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
10356

10457
## 💪 Thanks to all Contributors
10558

106-
Thanks a lot for spending your time helping this project grow. Thanks a lot! Keep rocking 🍻
59+
Thanks to all contributors for helping this project grow! 🍻
10760

10861
<a href="https://github.com/sanjay-kv/Stackoverflow-Analysis/graphs/contributors">
10962
<img src="https://contrib.rocks/image?repo=sanjay-kv/Stackoverflow-Analysis" />
11063
</a>
11164

11265
## 🙏 Support
11366

114-
This project needs a ⭐️ from you. Don't forget to leave a star ⭐️
115-
116-
117-
118-
119-
This repo is crafted with ♥ and owned/maintained by @sanjay-kv
120-
67+
Don't forget to leave a star ⭐️ for this project!
12168

69+
Crafted with ♥ by @sanjay-kv.

0 commit comments

Comments
 (0)