- Print
- PDF
The ETL (Extract, Transform, Load) process in ACE streamlines data imports by enabling users to upload, transform, and integrate large datasets. ETL jobs rely on configurations, profiles, and proper file formats to ensure accurate data loads. This guide explains common ETL challenges, provides troubleshooting steps, and highlights essential tools such as ETL Import History, Delta Logs, and Multithreading.
Common ETL Issues
File Formatting Errors: The imported file does not meet ACE’s layout requirements or contains invalid data.
Transformation Failures: Mismatched or missing field mappings in ETL profiles lead to incomplete imports.
Load Errors: Duplicate records, schema mismatches, or bundling issues disrupt the data load process.
Performance Bottlenecks: ETL jobs take too long to complete or fail to finish due to large file sizes or system overload.
Corrupted Files: Files fail to upload or load properly due to file corruption.
Step-by-Step Troubleshooting
1. Locating ETL Import History
To troubleshoot a failed ETL job, start by reviewing its Import History:
Navigate to Transactions → ETL Import → History.
On the Imports list screen, locate the file you want to analyze.
Click the file name to open the Delta Log page, which contains:
Delta Status: Completion status (e.g., COMPLETED, FAILED).
Start Time, End Time, and Duration: Identify timing issues.
Error Count: Review the number of errors logged.
Duplicate Count: Track duplicates that may disrupt imports.
Use the Download Messages button to export detailed logs, including warnings, errors, and impacted rows.
2. Resolving File Formatting Issues
Symptoms: Errors like "File format invalid" or "Missing required fields."
Steps:
Verify file layout against ACE’s Import Layouts documentation:
File must be in a supported format (e.g., CSV, TSV, or PSV).
Include mandatory fields like account numbers, dates, or client IDs.
Check for common formatting errors:
Invalid characters (e.g., special symbols).
Incorrect column headers or missing columns.
Correct the file and reload the data.
3. Fixing Transformation Failures
Symptoms: Errors such as "Field mapping failed" or incorrect data placement.
Steps:
Navigate to the ETL Profile used for the job under Accounts → ETL Import → Profiles.
Review and correct field mappings:
Open the Data Map section to map fields from the file to ACE fields.
Confirm correct mappings for required fields (e.g., client codes, demographic data).
Apply changes and re-run the ETL job.
For custom columns:
If prompted to update columns, confirm changes and map new fields in the Data Map section. See Handling Custom Columns in ETL Profile Updates for details.
4. Resolving Load Errors
Symptoms: Errors such as "Duplicate key violation" or "Schema mismatch."
Steps:
Review the Delta Log to identify problematic rows:
Use Download Messages to locate duplicates or mismatched data.
Focus on fields like Debt ID or Client Code.
Deduplicate the file:
Use Excel, SQL queries, or data-cleaning tools to remove duplicate records.
Check schema alignment:
Ensure source fields match ACE’s field types, lengths, and formats.
Disable problematic features:
Temporarily disable Bundling or SOL (Statute of Limitations) for legacy data imports.
5. Optimizing Performance
Symptoms: ETL jobs take too long or fail mid-process.
Steps:
Enable Multithreading:
Go to Setup → System and set the Maximum ETL Threads (recommended: 10–30).
Enable Multithreading in the ETL profile for faster imports. See ETL Multithreading for details.
Split large files:
Divide data into smaller batches for faster processing.
Schedule during off-peak hours:
Run ETL jobs when system activity is low.
Monitor system resources:
Check CPU and memory usage during the job. Adjust resources as needed. See ACE System Metrics: Tracking CPU and Memory Usage for more information.
Fixing Corrupted Files
Symptoms: Errors such as "File could not be read" or "Unexpected file termination."
Steps:
Detect Corruption:
Check if the file opens properly in a text editor or spreadsheet application.
Look for signs of corruption, such as:
Garbled text or unreadable characters.
Unexpected file size (too large or too small).
Missing data or rows that appear incomplete.
Resolve Corruption:
Re-export the file: If possible, regenerate the file from the original data source.
Clean the file:
Open the file in a spreadsheet tool (e.g., Excel).
Remove invalid characters or problematic rows.
Save the file as a new CSV or TSV file.
Test the File:
Upload a smaller, clean version of the file to verify it loads correctly.
If successful, upload the full file.
Key Tools and Features
Delta Log Details
The Delta Log provides a comprehensive summary of ETL jobs:
General Information:
Delta Status: COMPLETED, FAILED, or ROLLED BACK.
Error Count: Total errors during the job.
Records Loaded by Type: Breakdown of records imported (e.g., debts, transactions, demographics).
Rollback:
If errors are found, use the Rollback button to undo the data load.
Note: Accounts with existing transaction history cannot be rolled back.
Multithreading
Enables parallel processing of large files, reducing load times.
Best for large data imports like client lists, transactions, and account flags.
Caution: Do not use multithreading if the order of rows in the file matters.
ETL Profiles
Reusable configurations for consistent imports of similarly formatted files.
Steps to create an ETL profile:
Navigate to Accounts → ETL Import → Profiles → New.
Upload a sample file to map fields.
Save the profile for future imports.
Preventative Measures
Standardize File Formats:
Use ACE-approved layouts and templates to reduce formatting errors. See our Import Layout Category for details.
Audit ETL Profiles Regularly:
Verify that field mappings are accurate and up to date.
Schedule Imports Strategically:
Run large imports during off-peak hours to optimize performance.
Enable Notifications:
Use the On Errors Notify User ID feature to alert administrators of failed jobs.
Example Scenarios
Scenario 1: Duplicate Key Violation
Cause: The source file contains duplicate Debt IDs.
Solution:
Open the Delta Log and download error messages.
Deduplicate the file using Excel or SQL.
Reload the data.
Scenario 2: File Fails to Load Due to Schema Mismatch
Cause: Field types in the file (e.g., text instead of numeric) don’t match ACE requirements.
Solution:
Correct the mismatched fields in the file.
Reload the job and verify the data mapping.
Reporting Issues to Support
If you encounter errors or unexpected behavior on any page in the ACE system, follow the steps below to ensure that our support team has the information needed to assist you effectively.
Steps to Report an Issue
Take a Screenshot:
Capture the entire page, including:
The details visible on the page (e.g., fields, statuses, or tasks).
Any error messages displayed.
The URL in your browser’s address bar.
Include Key Details:
In addition to the screenshot, provide the following information:
What you were doing: A brief explanation of the actions you were taking when the issue occurred.
Steps to reproduce: A clear description of how the issue can be replicated (if applicable).
Error messages: Copy or include the exact error message text, if any.
Contact Support:
Submit the screenshot and all relevant details to support.