WHO-BCN Bulk-Load sheet generation scripts by nshandra · Pull Request #2 · EyeSeeTea/Bulk-Load-pytools

nshandra · 2023-06-06T10:27:10Z

📌 References

Issues:
- Review R script used to generate Excels to see if we can adapt it to our DHIS2 Templates
- Script to convert qualitative data docs into excels

📝 Implementation

Imported the existing script from the previous repository
Added currency adjustment option to make_quantitative_bulk_load_file.py

reviewpad · 2023-06-06T10:28:26Z

Thank you @nshandra for this first contribution!

reviewpad · 2023-06-07T11:07:04Z

AI-Generated Summary: This pull request introduces two new scripts: make_quantitative_bulk_load_file.py for processing CSV files from the "Data Extraction Tool" into "Bulk Load" Excel files (XLSX), and make_qualitative_bulk_load_file.py for automating the conversion of data from DOCX files into "Bulk Load" XLSX files used for qualitative data reviews. Both scripts accept various command-line options, including template files and debugging flags.

Additionally, new requirements.txt files are added to both the quantitative_data_script and qualitative_data_script directories to manage dependencies, .vscode is added to the .gitignore file, and the README.md file is significantly updated with installation instructions, examples, and script descriptions.

…essage.

Add option parsing. Add currency_converter and get_csv_indicator_value functions. Update README

reviewpad · 2023-09-11T10:50:14Z

AI-Generated Summary: This pull request introduces various changes across different aspects of the project:

A new file "requirements.txt" has been added to both the directories "WHO-BCN-data_scripts/qualitative_data_script" and "quantitative_data_script". These files contain the version specifications for the required python packages.
The README.md file has been updated to provide a comprehensive overview of the 'Bulk Load Pytools' Python tools. It now explains the installation of dependencies, as well as the usage and examples of the 'make_quantitative_bulk_load_file.py' and 'make_qualitative_bulk_load_file.py' scripts.
A new python script 'make_qualitative_bulk_load_file.py' has been introduced. It handles the process of turning DOCX files into "Bulk Load" XLSX files with quite a lot of functionality, including error checking, debugging, and logging features.
There has been a significant update to a script used for processing data from CSV files produced by the "Data Extraction Tool" and loading it into a Bulk Load XLSX file. This script uses a variety of imported modules, dictionaries, lists, and functions to conduct its tasks along with exception handling and logging mechanisms.
Lastly, the ".gitignore" file has been updated to ignore the ".vscode" directory, ensuring that local VS Code settings are not tracked or shared via the repository.

reviewpad · 2023-09-19T08:10:07Z

AI-Generated Summary: This pull request includes significant updates to the project documentation and introduces several Python scripts, along with other auxiliary files. The README.md file was updated with comprehensive documentation that outlines the installation and usage instructions for the scripts.

Four new Python scripts have been introduced, each with its unique functionality:

Two of these scripts are used for processing CSV and DOCX files, respectively, converting them into a specific format (.xlsx files in this case). They employ multiple standard and third-party Python libraries for their operations. Both scripts handle command-line arguments that define the processing parameters. These scripts are accompanied by two new requirements.txt files located in their respective directories, specifying the Python package dependencies.
The other two scripts seem to be used for data extraction, conversion, matching, and writing.

The .gitignore file has been updated to exclude '.vscode' directory, which means that the changes in Visual Studio Code settings won't be tracked or committed anymore.

Given the size and complexity of the newly created files, it's recommended to conduct a comprehensive code review, which may require several iterations and potentially some testing.

reviewpad · 2023-09-21T11:04:11Z

AI-Generated Summary: This pull request introduces several significant changes involving the addition of new scripts and updates to project documentation and configuration. The added scripts include make_quantitative_bulk_load_file.py and make_qualitative_bulk_load_file.py, both of which are responsible for handling and processing data from various file types. The quantitative script processes CSV files from a Data Extraction Tool into "Bulk Load" XLSX files, adjusting values based on command-line flags and utilizing online currency conversion. The qualitative script, on the other hand, processes DOCX files into Excel files, with functions for extracting specific data structures and handling command-line arguments.

Two new requirements.txt files were added in 'WHO-BCN-data_scripts/quantitative_data_script' and 'WHO-BCN-data-scripts\qualitative_data_script' directories, specifying dependencies such as 'openpyxl', 'pandas', and 'python_docx' at precise versions. The .gitignore file was updated to ignore editor-specific settings from VS Code. Additionally, the README.md file was substantially updated with detailed descriptions of the newly added scripts and installation instructions, significantly enriching the documentation and making it more informative for users.

…dataElement match (to deal with double spaces and such)

…YYYY-MM-DD)" added to the template name.

reviewpad · 2023-09-25T08:00:47Z

AI-Generated Summary: This pull request introduces several changes primarily focused on data transformation and dependency management.

A new Python file (make_quantitative_bulk_load_file.py) has been added to the WHO-BCN-data_scripts/quantitative_data_script/ directory. This script automates the complex process of transforming CSV data into an XLSX file based on certain rules and adjustments.

The .gitignore file now includes settings ensuring user-specific VS Code configurations are not disturbed.

Additionally, new requirements.txt were created in quantitative_data_script and qualitative_data_script directories respectively. These files specify necessary Python package dependencies for the project.

The README.md documentation has been significantly enhanced with comprehensive details and instructions for using the new Python scripts, installation process, and usage guidance involving various arguments and options.

Finally, a Python script for processing DOCX files into XLSX format was added. This script extracts various data from a given DOCX file and writes it into an Excel sheet. Helper functions are utilized throughout the data extraction and writing process, enhancing the script's efficiency and functionality. Command-line arguments, error handling, and logging have also been addressed in the script.

Overall, the pull request significantly enhances data management, extraction, and transformation with new scripts and appropriate changes.

If quintile is Total and no service use default combo. Add a list of DEs ignoring the quintile to determine the combo. Added a count message detailing the number of entries matched from the CSV and the XLSX.

reviewpad · 2023-09-25T12:06:02Z

AI-Generated Summary: This pull request includes several updates and additions related to the 'Bulk Load Pytools' project. New requirements.txt files have been added in both the WHO-BCN-data_scripts/qualitative_data_script and WHO-BCN-data_scripts/quantitative_data_script directories, specifying dependencies for the newly added Python scripts. The .gitignore file has been updated to exclude the Visual Studio Code settings folder. Two new Python scripts, make_quantitative_bulk_load_file.py and make_qualitative_bulk_load_file.py, have been added. These scripts process CSV files and DOCX files, respectively, into bulk load XLSX files with facilities to handle various special cases. The README.md file has been substantially updated to enhance the documentation of the 'Bulk Load Pytools' project which includes installation instruction, instructions on usage of the new scripts, debugging options, and other useful information.

… template.

…il the google sheet is updated

reviewpad · 2023-12-07T10:08:54Z

AI-Generated Summary: This pull request introduces a python script that converts .docx files to an Excel workbook, designed specifically for health care policy documentation conversion. The script parses the input arguments, verifies the files, extracts necessary information, validates it, and writes it into the Excel workbook. Numerous helper functions are included for smoother operations, adhering to a defined .docx table format.

The diff also includes updates and additions to requirements.txt in the WHO-BCN-data_scripts/qualitative_data_script and WHO-BCN-data_scripts/quantitative_data_script directories, specifying necessary packages and versions like openpyxl, python_docx, and pandas.

Edits to '.gitignore' were made to prevent tracking changes for the .vscode directory, and a potential need for a newline character at the end of the file was flagged.

Lastly, significant expansions were made in the README file for the "Bulk Load Pytools" project, offering comprehensive execution instructions, improved header formatting, and additional resources.

reviewpad · 2023-12-11T12:04:32Z

AI-Generated Summary: This pull request includes an update to the .gitignore file to exclude Visual Studio Code settings and introduces a "requirements.txt" file in the directories "WHO-BCN-data_scripts/qualitative_data_script" and "WHO-BCN-data_scripts/quantitative_data_script" to specify python package dependencies. There is also a new Python script named make_qualitative_bulk_load_file.py added to the directory WHO-BCN-data_scripts/qualitative_data_script/ for processing DOCX files into "Bulk Load" XLSX files. Substantial enhancements have been made to the project documentation in the README.md file, detailing the installation, usage instructions, and providing examples for the "Bulk Load Pytools" project. These changes also include correcting the project's title and referencing two python scripts (make_quantitative_bulk_load_file.py and make_qualitative_bulk_load_file.py). These scripts are integral to the project's utility as they process various file types into "Bulk Load" XLSX files. Lastly, this pull request involves the active use of helper functions and the main function in association with command-line arguments in the newly added python script, which ultimately contributes to writing data updates to the XLSX files.

…heckbox DEs from id list to avoid incorrect assignment.

…ption for now.

… multiple countries

…e_update Feature: Qualitative template update

…e_update Updates 2024 & 2025

…ate metadata tab

…cators that need to duplicate the total value.

…tal_value Store the total value for by consumption quintile indicators in 'Total' combo

…ter_values Fix: Handle empty currency converter values

…ntry codes Note: old non-standard entries kept to keep backwards compatibility

[Feature] Add code based name mapping

nshandra added 3 commits June 6, 2023 12:14

feat(quantitative): Import make_quantitative_bulk_load_file.py script.

3f870bf

feat(qualitative): Import make_qualitative_bulk_load_file.py script.

cb1eb4b

chore: Apply pep 8 style.

77b1096

nshandra requested review from ifoche and saragilcas June 6, 2023 10:27

nshandra self-assigned this Jun 6, 2023

reviewpad bot added the large Pull request is large label Jun 6, 2023

feat(qualitative): Ignore fields which names start with "Internal"

2c5d62c

nshandra added 3 commits September 8, 2023 12:48

fix: Fix Czech Republic code, fix extract_values_from_csv exception m…

8088629

…essage.

feat(currency): Add currency calculation option to script.

1b14e15

Add option parsing. Add currency_converter and get_csv_indicator_value functions. Update README

feat(currency): Update requirements.txt

84876f7

nshandra added 5 commits September 13, 2023 12:16

fix(temp): update hard-coded category option combo default ID

03a421d

feat: Add better debug output

0a950f1

feat(qualitative): Add better error control to get_country_and_year

4266c97

feat(qualitative): Add extract_charges_in_coverage

8df7a7f

feat(qualitative): Add extract_user_charges_by_type_table

ca87513

nshandra added 2 commits September 21, 2023 12:55

fix(qualitative): Avoid storing empty tables data

e8cd265

fix(qualitative): Add cleanup_string function

4873aa1

nshandra added 2 commits September 21, 2023 15:33

feat(qualitative): Add better errors messages, add string cleanup to …

5a98e42

…dataElement match (to deal with double spaces and such)

fix(qualitative): Add special condition for Date updated DE having "(…

694a04d

…YYYY-MM-DD)" added to the template name.

feat(quantitative): Adjust the script to the existing metadata.

5e9cadc

If quintile is Total and no service use default combo. Add a list of DEs ignoring the quintile to determine the combo. Added a count message detailing the number of entries matched from the CSV and the XLSX.

nshandra added 2 commits September 25, 2023 17:15

feat(quantitative): Get the category option combo default ID from the…

6c3cf92

… template.

fix(temp): Added temporal fix to make_quantitative_bulk_load_file unt…

dd83ec8

…il the google sheet is updated

fix: check for empty lines and malformed lines in fix_references_format

d0bebf5

fix: update Netherlands country code

1982e2d

nshandra marked this pull request as ready for review April 8, 2024 09:52

nshandra requested review from Ramon-Jimenez and removed request for saragilcas July 4, 2024 08:43

nshandra and others added 20 commits July 4, 2024 12:41

fix: typos in README

00ab903

feat: add include internal option, update hard-coded fields, remove c…

75879c2

…heckbox DEs from id list to avoid incorrect assignment.

feat: update README

66b3e79

fix: remove verbose message, fix extract_user_charges_by_type_table

7c70e7b

feat: add Date updated (YYYY-MM-DD) checker and rudimentary fixer

15d7c92

feat: add check_date_updated_format function. Remove the --internal o…

936183f

…ption for now.

fix: write_indicator was applying the incorrect offset when importing…

982abea

… multiple countries

Merge pull request #4 from EyeSeeTea/feature/uhcw_qualitative_templat…

5b5dc18

…e_update Feature: Qualitative template update

make 20 default value for max coverage policy changes per year

3ae0d33

Merge pull request #5 from EyeSeeTea/feature/uhcw_qualitative_templat…

0fa2f4d

…e_update Updates 2024 & 2025

feat: switch hard coded COMBO_LIST to creating it from the xlsx_templ…

ef1e1ff

…ate metadata tab

feat: add find_total_quintile_indicator function to identify the indi…

f6c36f3

…cators that need to duplicate the total value.

feat: add insert_value_if_valid, copy total value to matching quintile

fd7242f

Merge pull request #7 from EyeSeeTea/feat/add_consumption_quintile_to…

0a3c551

…tal_value Store the total value for by consumption quintile indicators in 'Total' combo

feat: print error messages to stderr

7c24446

feat: improve error control for the coefficient retrieval

5a1a533

Merge pull request #8 from EyeSeeTea/fix/handle_empty_currency_conver…

bbabf8d

…ter_values Fix: Handle empty currency converter values

update Contry Dict with BIH sub-entities and KOS

6d72715

feat: add EU org unit and entries for countries with non-standard cou…

a195aa9

…ntry codes Note: old non-standard entries kept to keep backwards compatibility

feat: add indicator_id (DE code) matching via dict to the name mapping

b7fdaea

nshandra force-pushed the who_bcn_data_scripts branch from e764207 to 6d72715 Compare February 24, 2026 09:24

Ramon-Jimenez and others added 2 commits February 25, 2026 10:37

Merge pull request #9 from EyeSeeTea/feat/add_code_dict_name_map

f7b3766

[Feature] Add code based name mapping

feat: add two new DEs

0d82022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WHO-BCN Bulk-Load sheet generation scripts#2

WHO-BCN Bulk-Load sheet generation scripts#2
nshandra wants to merge 55 commits intodevelopmentfrom
who_bcn_data_scripts

nshandra commented Jun 6, 2023 •

edited

Loading

Uh oh!

reviewpad bot commented Jun 6, 2023

Uh oh!

reviewpad bot commented Jun 7, 2023

Uh oh!

reviewpad bot commented Sep 11, 2023

Uh oh!

reviewpad bot commented Sep 19, 2023

Uh oh!

reviewpad bot commented Sep 21, 2023

Uh oh!

reviewpad bot commented Sep 25, 2023

Uh oh!

reviewpad bot commented Sep 25, 2023

Uh oh!

reviewpad bot commented Dec 7, 2023

Uh oh!

reviewpad bot commented Dec 11, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nshandra commented Jun 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 References

📝 Implementation

Uh oh!

reviewpad bot commented Jun 6, 2023

Uh oh!

reviewpad bot commented Jun 7, 2023

Uh oh!

reviewpad bot commented Sep 11, 2023

Uh oh!

reviewpad bot commented Sep 19, 2023

Uh oh!

reviewpad bot commented Sep 21, 2023

Uh oh!

reviewpad bot commented Sep 25, 2023

Uh oh!

reviewpad bot commented Sep 25, 2023

Uh oh!

reviewpad bot commented Dec 7, 2023

Uh oh!

reviewpad bot commented Dec 11, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nshandra commented Jun 6, 2023 •

edited

Loading