Parse and Normalize API

Table of Contents

Summary
Request structure
Response structure
Versioning

Summary

This service is an enhanced version of our Resume Parsing API that parses a Base64-encoded resume and further enriches the parsed data by issuing calls to several other classification APIs. Specifically, the service currently makes one call to our Geography API to normalize the candidate's location information; one call to the Skills API to parse skills from the complete text of the resume; and one call per work history item to the Job Level, Job Title, [Company Normalization] (https://github.com/cbdr/DataScienceAPIDocumentation/blob/master/CompanyNormalization.md), and Skills APIs to enrich each work history accordingly.

These calls can be enabled or disabled as desired using the desired_enrichments parameter; for example, a client who only needs Job Title and Skills classifications could structure a Parse and Normalize request that would only retrieve these classifications. This parameter is documented in further detail below.

The service is located at https://api.careerbuilder.com/core/parsing/normalizedresume. As usual, you will need OAuth core credentials to use this service. (If you do not have these, please go here or email PlatformSoftware@careerbuilder.com to request core credentials.)

Request Structure

This service supports the HTTP GET and POST methods. Because Base64-encoded documents can be quite large, POST is encouraged for production use.

The following parameters may be supplied in the query string (for HTTP GET) or form body (for HTTP POST):

document (Required) -- A .doc, .docx, .pdf, .rtf, .txt, .odt, .wps, and .pages documents given in a BASE64 encoded string. Please note that the .pages format is not accepted by Textkernel; you will need to specify another parser.
desired_enrichments (Required) -- A comma-separated list of the desired normalization calls to perform on the results of the resume parsing operation. The list of possible values is as follows (case-insensitive): company_norm, geocoding, job_level, job_title_carotene, job_title_onet, school_norm, skills. For example, a request with a desired_enrichments value equal to job_level,skills,job_title_onet,company_norm would receive job level classifications, skills extractions, ONet job title classifications, and company normalizations. The API does not currently allow callers to request only certain versions of a classification service.
The value "none" may be supplied to skip all post-parsing classifications and simply return the results of the parse. At present, a request that does not include this parameter will receive all classifications; this is temporary behavior for the sake of backwards compatibility. Once all customers have started using the desired_enrichments parameter, its usage will become required, and requests excluding it will result in a 400 Bad Request response code.
service (Optional) -- The resume parsing service you wish to use. Accepted values are "sovren", "textkernel", and "daxtra". If this parameter is not provided, the service will automatically select the preferred parser for the document's language. Click here for the current list of service defaults by language.
language (Optional) -- ISO 639-1 language code. If this parameter is not provided the service will run language detection on the provided document to get a language code.

Response Structure

{
  "data": {
    "clean_resume_text": string,
    "raw_resume_text": string,
    "resume_html": string,
    "has_managed_others": boolean,
    "formatted_name": string,
    "first_name": string,
    "middle_name": string,
    "last_name": string,
    "affix": string,
    "email_address": string,
    "home_number": string,
    "office_number": string,
    "mobile_number": string,
    "fax_number": string,
    "pager_number": string,
    "country": string,
    "zip": string,
    "city": string,
    "state": string,
    "address_line1": string,
    "is_currently_employed": boolean,
    "most_recent_employer": string,
    "most_recent_job_title": string,
    "experience_months": integer,
    "last_job_months": integer,
    "number_of_jobs": integer,
    "highest_degree_type": string,
    "languages": [
      {
        "language_code": string,
        "comments": string,
        "speak": string,
        "read": string,
        "write": string
      }
    ],
    "resume_education_histories": [
      {
        "school_normalization": {
                    "school_normalization_v1": [
                        {
                            "normalized_school_name": string,
                            "id": string,
                            "country": string,
                            "confidence": double
                        }
                    ]
        },
        "school_name": string,
        "address_type": string,
        "city": string,
        "state": string,
        "country": string,
        "degree_major": string,
        "degree_date": string,
        "degree_type": string,
        "degree_comments": string,
        "attendance_start_date": string,
        "attendance_end_date": string,
        "educational_measure_system": string,
        "measure_system_value": string,
        "measure_system_lowest": string,
        "measure_system_highest": string
      }
    ],
    "employments": [
      {
        "job_titles": {
          "onet15": [
            {
              "title": string,
              "id": string,
              "confidence": integer
            },
            [... more onet15 results]
          ],
          [... results for other taxonomies]
        },
        "job_level": {
          "1.0": {
            "text": string,
            "level": string,
            "description": string
          }
        },
        "skills": {
          "2.0": [
            {
              "skilldid": string,
              "normalized_term": string,
              "confidence": float
            },
            [... more skillsV2 results]
          ],
          "3.0": [
            {
              "skilldid": string,
              "normalized_term": string,
              "confidence": float
            },
            [... more skillsV3 results]
          ]
        },
        "company_normalization": {
          "1.0": {
            "company_depot": {
              "normalized_companies": [
                {
                "confidence": double,
                "normalized_name": string,
                "id": string,
                "naics_code": string,
                "naics_description": string,
                "duns_number": string,
                "website": string,
                "country": string,
                "state": string,
                "postal_code": string,
                "city": string,
                "address": string,
                "company_size": int
                }
              ],
              "master_company": {
                "confidence": double,
                "normalized_name": string,
                "id": string,
                "naics_code": string,
                "naics_description": string,
                "duns_number": string,
                "website": string,
                "country": string,
                "state": string,
                "postal_code": string,
                "city": string,
                "address": string,
                "company_size": int
              },
              "data_version": "string"
            },
            "data_dot_com": {
              "normalized_companies": [
                {
                "confidence": double,
                "normalized_name": string,
                "id": string,
                "naics_code": string,
                "naics_description": string,
                "duns_number": string,
                "website": string,
                "country": string,
                "state": string,
                "postal_code": string,
                "city": string,
                "address": string,
                "company_size": int
                }
              ],
              "data_version": "string"
            }
          }
        },
        "city": string,
        "state": string,
        "country": string,
        "website": string,
        "job_title": string,
        "employer_name": string,
        "start_date": string,
        "end_date": string,
        "description": string,
        "job_type": string,
        "duration": integer,
        "is_current_position": boolean
      }
    ],
    "skills": {
      "2.0": [
        {
          "skilldid": string,
          "normalized_term": string,
          "confidence": float
        },
        [... more skillsV2 results]
      ],
      "3.0": [
        {
          "skilldid": string,
          "normalized_term": string,
          "confidence": float
        },
        [... more skillsV3 results]
      ]
    },
    "job_level": {
      "1.0": {
        "text": string,
        "level": string,
        "description": string
      }
    },
    "job_titles": {
      "onet15": [
        {
          "title": string,
          "id": string,
          "confidence": integer
        },
        [... more onet15 results]
      ],
      [... results for other taxonomies]
    },
    "geography": {
      "1.0": [
        {
          "admin_areas": [
            {
              "long_name": string,
              "short_name": string,
              "name": string,
              "level": integer
            },
            {
              "long_name": string,
              "short_name": string,
              "name": string,
              "level": integer
            }
          ],
          "city": string,
          "country": string,
          "country_code": string,
          "landmark": string,
          "latitude": float,
          "location_type": string,
          "longitude": float,
          "postal_code": string,
          "street_address": string,
          "sublocality": string
        }
      ]
    }
  }
}

#Versioning

The data returned is unversioned. The current version is 1.0. We expect that each of our vendors return the same data for repeated calls, however we have not verified this systematically. We will occasionally update our vendors which may change the output. If we believe this change is significant we will communicate about it. However, customers will not be able to specify vendor versions; we will not be running multiple versions of a parser. The language to parser mapping is unversioned. We will update it as we understand each vendor's capabilities more fully. If you need to stay on the same parser, please specify it explicitly.

Our general versioning strategy is available here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parse and Normalize API

Summary

Request Structure

Response Structure

#Versioning

FilesExpand file tree

ParseAndNormalize.md

Latest commit

History

ParseAndNormalize.md

File metadata and controls

Parse and Normalize API

Summary

Request Structure

Response Structure

#Versioning