How to Name Your Tableau Data Source to Prevent AI Hallucinations (Tableau Pulse & Copilot)
Many administrators who have introduced AI features like Tableau Pulse or Einstein Copilot into their Tableau environments are struggling with "hallucinations"—situations where the AI fetches the wrong data to answer user questions.
The biggest cause of these AI hallucinations lies in the messy 'Data Pane' hidden behind your beautiful dashboard screens. AI doesn't read the visual charts; it reads the text (field names) in the Data Pane on the left to analyze your data. For a successful implementation of Pulse & Copilot and solid data governance, here is a naming guide you should apply to your data sources.
1. Avoid Abbreviations and Internal Jargon (Use Full Names)
Leaving column names like TXN_AMT or CUST_ID exactly as they came from the database for developer convenience will severely confuse Large Language Models (LLMs). To help the AI accurately map the inherent meaning of each field, you must rename your data source aliases using clear business language (full names) rather than abbreviations or machine language.
- Recommended (Full Name):
Total_Sales,Customer_ID - Not Recommended (Abbreviation/Machine Language):
tmp_sales,TXN_AMT
2. Explicitly Express Field Attributes
Help the AI instantly grasp the nature of the data by explicitly stating at the beginning of the name whether it is a date or a logical (boolean) value indicating true/false.
- Date Fields: Use the
Date_prefix. (e.g.,Date_Order,Date_First_Purchase) - Boolean Fields: Use the
Is_orHas_prefix. Instead of simply naming a fieldActive, naming itIs_Active_Userclearly communicates to the AI that this is a true/false criterion determining whether a user is active.
3. Remove Unnecessary Version Tags and Clutter (Hide Unused Fields)
During data processing, it's common to append version numbers or temporary statuses to field names (e.g., calc_rev_01, temp_sales_backup). If these tags remain, the AI will get confused about which of the many dimensions and measures it should prioritize for its analysis.
- Recommended:
Total_Revenue - Not Recommended:
Revenue_Data_Final_v3
Unused intermediate calculations or unnecessary columns from the original DB should be boldly Hidden. Leave only the cleanest, final fields visible for the AI to utilize in its analysis.
4. Leverage 'Default Properties ➔ Comment' as a Metadata Goldmine
Entering a specific data dictionary definition here, such as "This field represents net sales excluding VAT," becomes the most powerful Knowledge Base. It enables the AI to interpret the data correctly and provide highly accurate answers to your end-users.
A systematic data source naming convention is more than just cleaning up text. In the upcoming era of AI agents, it is the crucial first step to ensuring data integrity and building an analytics environment your organization can truly trust. Start inspecting the field names of your 'Published Data Sources' shared on the server today.