Skip to content

add details about data type mismatch #4031

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Jan 19, 2023
Merged

Conversation

muhammadatifjav
Copy link
Contributor

Proposed changes

Warehouse data type mismatch is a common issue that the customers face. We have macros/internal docs to explain the issue, and offer possible resolution, but it can be helpful if we can explain why this happens and quicker to resolve if the customer uses the template when reaching out to resolve this data type mismatch.

The changes proposed include:

  • Structuring "New Columns", "Supported Data Types", and "Column Sizing" under "Schema Evolution and Compatibility" heading
  • Adding content to explain how data type mismatches occurs, steps to resolve and considerations to take in account, and also give a format to the customer to use when reaching out with requests

Merge timing

Related issues (optional)

@muhammadatifjav muhammadatifjav marked this pull request as ready for review January 12, 2023 22:24

In cases where a data type is determined incorrectly, the support team can help you update the data type. As an example, if a field can include float values as well as integers, but the first value we received was an integer, we will set the data type of the field to integer, resulting in a loss of precision.

To update the data type, the support team will update the internal schema that Segment uses to infer your warehouse schema. We will start syncing the data with the correct data type after the change is made. However, if you want to backfill all historical data correctly, it will be required to drop the impacted tables on your end so Segment can recreate them in the correct datatype, and then backfill those tables.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @muhammadatifjav, I had a quick question about this part - after a customer drops the impacted tables, would Segment automatically recreate the tables and backfill them, or would a customer need to reach out to facilitate the backfill?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After customer drops the impacted table, Segment will automatically recreate the table and start syncing new data automatically. However, if they want historical data to be corrected, then they will need to drop the table, and then confirm it to us and we will then run the backfill. I've made another commit to make it more clearer, lmk if that is better

@forstisabella forstisabella merged commit 2606cad into develop Jan 19, 2023
@forstisabella forstisabella deleted the warehouse-schema-datatype branch January 19, 2023 17:26
@github-actions
Copy link
Contributor

Thank you for your contribution! Your pull request is merged, but may take a day or two to appear on the site.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants