On the Connection Page of the Create Entity wizard, select the Loader Connection given to you by the administrator that you have checked in Step 1. Leave the default settings for the filter and the connection and click Next. Oracle Data Quality connects to the data source using the Loader Connection you selected in Step 4. If the connection fails, contact your Metabase Administrator. By using Panoply to sync, store, and access your Oracle data, you can eliminate data management headaches. Panoply’s 3rd-party Oracle connector makes it possible to push data from Oracle into a secure data warehouse in minutes and without complicated code. Data in Panoply is always query-ready, making it easy for analysts to combine Oracle data with other data sources for faster and more. Metabase has the ability to connect to some databases by first establishing a connection to a server in between Metabase and a data warehouse, then connect to the data warehouse using that connection as a bridge. This makes connecting to some data warehouses possible in situations that would otherwise prevent the use of Metabase. With Panoply, you can eliminate that hassle—and all that wasted time. Set up code-free connections to both PostgreSQL and Metabase with just a few clicks. Then Panoply will automatically sync your data so it’s ready to analyze when you are. It’s that simple: Unblock your data with Panoply.
This chapter describes how to install and configure Oracle Data Integrator. Post-installation configuration parameters are also provided.
The following topics are covered:
2.1 Preparing to Install
Review the information in this section before you begin:
2.1.1 Review System Requirements and Certification
Before installing any Oracle Data Quality products, you should read the system requirements and certification documentation to ensure that your environment meets the minimum installation requirements. Both of these documents are available on Oracle Technology Network (OTN).
The system requirements document covers information such as hardware and software requirements, minimum disk space and memory requirements, and required system libraries, packages, or patches:
The certification document covers supported installation types, platforms, operating systems, databases, and third-party products:
2.1.2Define User Accounts
Oracle Data Quality products rely on multiple users to perform certain procedures (as defined in Table 2-1). You create the Oracle Data Quality users during and after the installation procedure. You define the UNIX operating system users before you install Oracle Data Profiling and Oracle Data Quality for Oracle Data Integrator.
Table 2-1 Required User Accounts
User | Description |
---|---|
Or:
| Some procedures must be performed by an operating system user with root or super user access. When you enter a root password during the software installation, See your operating system documentation for more information on these user types. |
Oracle Data Quality Application Administrator | An operating system user who installs the Oracle Data Quality Server application and administers the Oracle Data Quality Scheduler and License Manager. (This user is required when installing on either Windows operating systems or UNIX operating systems.) For information on creating this user, see Section 2.1.2.1, 'Define an Oracle Data Quality Application Administrator'. |
Oracle Data Quality Loader User | User who will access data import directories (located on the Oracle Data Quality server) through a login screen in the Oracle Data Quality User Interface. This user is not required if you plan to directly access data from relational sources (Oracle, IBM DB2, ODBC). You will, however, need a user id that gives you access to each specific database. NOTE: This user is only required if you plan to load data from flat file sources (delimited, COBOL, Oracle Data Quality sources). For more information on creating this user, see Section 2.1.2.2, 'Define Oracle Data Quality Loader Users'. |
Metabase Administrator | Oracle Data Quality user account that creates and maintains Oracle Data Quality repositories, and defines metabases, Oracle Data Quality users, and data connections. This user is also known as the Oracle Data Quality Repository User. NOTE: The metabase administrator is created during Oracle Data Quality installation. |
Oracle Data Quality User | Oracle Data Quality user account that accesses Oracle Data Quality metabases through the Oracle Data Quality User Interface NOTE: Oracle Data Quality users are created by the metabase administrator after installation. |
2.1.2.1 Define an Oracle Data Quality Application Administrator
You must create an Oracle Data Quality Application Administrator and, if you are using sudo
, you must also grant sudo
rights before installing Oracle Date Quality products. This administrator installs the server application and administers the Scheduler and License Manager on the Oracle Data Quality server.
To create an Application Administrator on UNIX Operating Systems:
As the UNIX root user, create an Oracle Data Quality administrator user account to perform Oracle Data Quality administrative activities.
There are no naming restrictions for the Oracle Data Quality administrator name, but the recommendation is that the user account be named odqadmin.
Do one of the following:
If you are not leveraging the security provided through
sudo
, make sure that the Oracle Data Quality administrator has read access to any data import directories that you define. Proceed to 'Define Oracle Data Quality Loader Users'.If you are leveraging the security features of
sudo
, proceed to the next step.
As the UNIX root user, type the command:
visudo
. This brings up the file namedsudoers
for editing.NOTE: You must always edit this file through the
visudo
command.In the Defaults specification section, add the following entries:
In the User Privilege specification section, add:
Save the file and exit.
Verify that
sudo
is correctly configured.Log in as the newly created Oracle Data Quality administrator. For example, type:
When prompted, enter the password for the user.
Type the command:
id
The operating system should return the UNIX user id. If not, contact your system administrator.
To create an Application Administrator on Windows Operating Systems:
As the Windows Server Administrator, create a Windows user account for the Oracle Data Quality Administrator.
There are no naming restrictions for the Oracle Data Quality administrator name, but the recommendation is that the user account be named odqadmin.
Add the Oracle Data Quality Administrator user odqadmin to the group Administrators.
From the Windows Control Panel, open Administrative Tools, Local Security Policy and expand the Security Settings > Local Policies folder.
Add the Oracle Data Quality Application Administrator user odqadmin to the following options:
Log on as a Service
Act as part of the operating system
2.1.2.2 Define Oracle Data Quality Loader Users
Define an Oracle Data Quality Loader User account on the UNIX server if you plan to import data from flat files. When you import flat file data into Oracle Data Quality, you access the file location on the server through a UNIX user id.
For example, if you are importing flat files described by COBOL copybooks that are residing on the Oracle Data Quality server in the directory /data
, then you require an Oracle Data Quality Loader User (UNIX user id) who can log on to the Oracle Data Quality server and read the files from /data
.
Note:
If you intend to import data directly from a relational source, you do not need to perform these steps since a UNIX userid is not required. The only user required is for RDBMS login accessDefine a new user account or select an existing user account to act as an Oracle Data Quality Loader User.
NOTE: For Windows operating systems, determine whether you will create a single user account that the team will share or if each user will have their own user account.
Give the account read access to the data import directory that you plan to use when you create a Loader Connection.
NOTE: For Windows operating systems, add each user account (that will access the flat file data) to the appropriate Windows user group for each secured location. User accounts not contained in the Windows group will not be able to import flat file data for that loader connection.
2.1.3 Identify Available Ports
During installation you must provide at least 2 port numbers that are accessible from the client to the server and are not blocked. These port numbers are used for the following services:
The Oracle Data Quality Scheduler requires a port called Scheduler Port.
The Oracle Data Profiling and Quality Metabase Repository requires a port called Repository Port.
Note:
If you are planning to use ODBC datasources from a Windows-based server, you will need to reserve a third port for the Oracle Data Quality ODBC Adapter.To identify ports that have applications listening on them, type the netstat -an
command. Select two available ports and make note of them for the setup procedure.
Note:
The port numbers should be greater than 1000 and not exceed 65535, and for easy reference, should be consecutive numbers. (For example,7600
for the Repository Port and 7601
for the Scheduler Port.)2.2 Installing Oracle Data Quality Products
This section contains information and instructions for installing Oracle Data Integrator:
2.2.1Starting the Installer
To start the installer, insert the Oracle Data Quality installation disk or navigate to /Disk1
in the ODQ installation directory (where you saved the ODQ .zip or .jar file) and run the following command:
On UNIX operating systems:
On Windows operating systems:
Refer to the Oracle Fusion Middleware Certification document to see the JDKs supported for your system: http://www.oracle.com/technology/software/products/ias/files/fusion_certification.html
2.2.2Installation Log Files
The installer writes logs files to the Oracle_Inventory_Location
/log
(on UNIX operating systems) or Oracle_Inventory_Location
logs
(on Windows operating systems) directory. On UNIX systems, if you do not know the location of your Oracle Inventory directory, you can find it in the oraInst.loc
file in the following directories (default locations):
Linux operating systems:
/etc/oraInst.loc
HP-UX and Solaris operating systems:
/var/opt/oracle/oraInst.loc
Windows operating systems:
Program FilesOracleInventorylogs
2.2.3Installing on UNIX Operating Systems
The Oracle Data Profiling and Quality installation for UNIX operating systems includes the following components:
Oracle Data Quality for Data Integrator
Metabase Server
Metabase Definitions
Table 2-2 describes the screens included in a UNIX operating system installation. For more information, see Appendix B, 'Oracle Data Profiling and Data Quality Installation Screens'.
Table 2-2 UNIX Operating System Installation Flow
No. | Screen | When Does This Screen Appear? | Description and Action Required |
---|---|---|---|
1 | Always | Click Next to continue. | |
2 | Always | This screen analyzes the host computer to ensure that specific operating system prerequisites have been met. If any of the prerequisite checks fail, then a short error message appears in the bottom portion of the screen. Fix the error and click Retry to try again. If you want to ignore the error or warning messages and continue with the installation, click Continue. Click Abort to stop prerequisite checking for all components. | |
3 | Always | In the Location field, enter the Oracle home (referred to in this guide as Click Next to continue. | |
4 | Always | This screen configures the Metabase Server. Provide the required information and click Next to continue. | |
5 | Always | Review the summary and click Install to continue. | |
6 | Always | The installer automatically executes each configuration assistant in sequence, displaying the progress in the Status column. No action is required on this screen. | |
7 | Always | If you want to save this configuration to a text file, click Save. This file can be used later if you choose to perform the same installation from the command line. Click Finish to close the installer. |
2.2.4Installing on Windows Operating Systems
Table 2-3 describes the screens included in a Windows operating system installation. For more information, see Appendix B, 'Oracle Data Profiling and Data Quality Installation Screens'.
Table 2-3 Windows Operating System Installation Flow
No. | Screen | When Does This Screen Appear? | Description and Action Required |
---|---|---|---|
1 | Always | Click Next to continue. | |
2 | Always | Select the components you want to install. The options are:
Click Next to continue. | |
3 | Always | Click Next to continue. | |
4 | Always | Specify the absolute path to your Oracle home (referred to in this guide as Click Next to continue. | |
5 | Always | This screen configures the Metabase Server. Provide the required information and click Next to continue. | |
6 | Only if you selected Metabase Client on the Select Components Screen (Windows Operating Systems Only) screen. | This screen configures the client to connect to the Metabase and ODBC Servers. Provide the required information and click Next to continue. | |
7 | Always | Review the summary and click Install to continue. | |
8 | Always | The installer automatically executes each configuration assistant in sequence, displaying the progress in the Status column. No action is required on this screen. | |
9 | If you want to save this configuration to a text file, click Save. This file can be used later if you choose to perform the same installation from the command line. Click Finish to close the installer. |
2.3 Installing Additional Postal Tables
The postal tables are a critical part of the data quality process because they provide the postal information that is used to validate and improve the name and address data in your records. This section describes how to install postal and census directories for for an Oracle Data Quality Server.
2.3.1 Postal Table Naming Conventions
The postal tables and census/DPV directories are delivered in a compressed format. The file extensions are .zip (for Windows operating systems) and .tar (for UNIX operating systems).
Table 2-4 describe the file naming conventions that are used:
Table 2-4 Naming Conventions
Table or Directory Name | Naming Convention Used |
---|---|
General and Asian Postal Tables |
For example:
|
Global Postal Tables NOTE: Global Postal Tables are a subset of international postal tables that are invoked from within Oracle Data Quality. Global Postal Tables are distinct because they require an additional service. The postal tables in this subset are: Austria, Brazil, Czech Republic, Denmark, Finland, Greece, Hungary, Ireland, Mexico, New Zealand, Norway, Poland, and Sweden. |
For example:
|
Census Directories NOTE: Census data is available only for the United States. |
|
DPV Directory |
|
LACS DirectoryLink | USLMMMq.ext. This is the name of the United States LACSLink directory, and ext is either zip or tar. |
SuiteLinkDirectory |
NOTE:
|
2.3.2 Postal Code Installation Locations
By default, postal-related files and directories are installed to the following locations:
General Postal Tables (including DPV, LACSLink and SuiteLink tables)
On UNIX operating systems:
ODQ_HOME/oracledq/12/tables/postal_tables
On Windows operating systems:
ODQ_HOMEoracledqtables*
Global Postal Tables and Latitude/Longitude Tables
On UNIX operating systems:
ODQ_HOME/oracledq/12/tables/postal_tables
On Windows operating systems:
ODQ_HOMEoracledqtables*
Asian Postal Tables (China, Japan, Korea, Taiwan)
On UNIX operating systems:
ODQ_HOME/oracledq/12/tables/postal_tables
On Windows operating systems:
ODQ_HOMEoracledqtables*
Census
On UNIX operating systems:
ODQ_HOME/oracledq/12/tables/postal_tables
On Windows operating systems:
ODQ_HOMEoracledqtables*
Metabase Oracle Connection Control
If you want to use a different location for the postal tables, follow the procedures Section 2.3.3, 'Alternative Postal Code Installation Locations'.
2.3.3 Alternative Postal Code Installation Locations
Oracle recommends that you install the postal tables in the default directories (as defined in Section 2.3.2). If you prefer to use an alternative directory, you must define the alternative locations before installing the postal tables.
To specify an alternative location for the postal tables:
Shut down Oracle Data Quality products, including all Oracle Data Quality clients.
Close the Oracle Data Quality Scheduler service.
Start the Metabase Server application.
On Windows operating systems:
Programs > Oracle Data Quality Software > Metabase
On UNIX operating systems:
./mtb_admin
When prompted, log on as the metabase administrator (
madmin
).Use the
_control
metabase name to apply the alternative location to all metabases.Ensure that changes have not already been made to the postal directory installation locations by typing the following commands, individually.
The commands should return the following failure message: “No setting with name nnn_directory found in table default_settings.” For example, “No setting with name '
census_directory
' found in table 'default settings
'.”NOTE: If a value is returned, you need to remove the current setting before defining a new one. See Removing a Postal Directory Definition for more information.
Create the alternative installation directory for non-Asian postal tables, census tables, global postal tables, and/or Asian postal tables.
Move any installed
xxCITY
files from the default installation directory to the new, alternative directory.These files were copied to the default postal directories when TS Quality project templates were installed. They must be in the same location as the postal tables.
Remove the default directories (or leave them empty).
At the
mtb_admin
prompt, type:Define the alternative locations by issuing one or more of the following commands:
For non-Asian postal tables:
For Asian postal tables:
For census tables:
For global postal tables:
where
d:newpostal
is the path of the alternative location.
Type
exit
to close the command prompt window.For global postal tables and latitude/longitude tables, there is an additional step.
With a text editor, open the global postal table gaserver.ini file, which is located in the
ODQ_HOME/oracledq/12/Software/bin
directory).Set the CountryDataDirectory entry to point to the alternative location of the global postal table datastore
rdata
directory.Set the KnowledgeBaseDirectory entry to point to the location of the global postal table datastore
kbase
directory.Locate the LicenseDirectory entry and point it to the global postal table license directory. Figure 2-1 shows an example of a modified
gaserver.ini
file.Save and close the file.
Copy the file you modified and paste it in the
ODQ_HOME/oracledq/12/Software/bin/latlong
directory, overwriting the existing file.NOTE: Both versions of the
gaserver.ini
file must point to the same locations.
Restart Oracle Data Quality and the Scheduler.
2.3.4Removing a Postal Directory Definition
To remove a postal directory definition:
Shut down Oracle Data Quality products, including all Oracle Data Quality clients.
Close the Oracle Data Quality Scheduler service.
Start the Metabase Server application.
On UNIX operating systems:
./mtb_admin
On Windows operating systems:
Programs > Oracle Data Quality Software > Metabase Server > Administrator Command Prompt from the Start menu.
When prompted, log on as the metabase administrator (madmin) to the
_control
metabase.Issue the appropriate command:
2.3.5Postal Code Installation Procedures
The basic installation procedure is the same for all postal tables categories. However, there is an extra step involved in setting up a Global Postal Table.
To install postal directory files:
Download the Postal Table file and copy it to the postal install directory. This file is located on the installation CD/DVD or was installed on your machine from an FTP session. See Postal Table Naming Conventions for a description of the file name formats.
Decompress the .tar or .zip file using the appropriate command (WinZip or tar). The expanded files are placed into the current directory.
NOTE: Oracle Data Quality Software compresses some postal tables because of their size. Review the list of files for any that have a
.Z
extension, which indicates compression. If necessary, uncompress files by entering the following command:If you purchased one of the Global Postal Tables, complete the procedure “To set up the Global postal service in Windows” or “To set up a Global postal service in UNIX.” (The following countries use a Global Postal Table: Austria, Brazil, Czech Republic, Denmark, Finland, Greece, Hungary, Ireland, Mexico, New Zealand, Norway, Poland, and Sweden.)
2.3.6 Setting Up a Global Postal Service on UNIX Operating Systems
Use the following steps to set up a global postal service on UNIX operating systems.
Complete the installation of the Global Postal Tables as described in Section 2.3.5.
Change to the directory that contains the file gactl.
For AIX operating systems, this file can be found in the directory:
ODQ_HOME/oracledq/12/Software/GA_server
For all other UNIX operating systems, this file can be found in the directory:
ODQ_HOME/oracledq/12/Software/bin
Enter the following command:
NOTE: If you need to install a new Global Postal Table, you must stop the service, install the new table, then restart the service.
To stop the service, issue the following command:
2.3.7 Setting up a Global Postal Service on Windows Operating Systems
To set up the Global postal service in Windows operating systems:
Complete the installation of the Global Postal Tables.
From the Start menu, select Programs > Oracle Data Quality products > Global Postal Matcher > Create Service.
(Optional) If you have defined a network drive as the alternate location for the GA postal tables, as described in Section 2.3.3, you must modify the Global Address Service properties.
Go to the Windows Services page (Start > Settings > Control Panel > Administrative Tools > Services).
Locate the Global Address Server entry and right-click.
Select Properties and then click the Log On tab.
Select the This Account option and enter your domain name and user name in the first field (for example,
domain_namejsmith)
.In the Password and Confirm Password fields, enter the password associated with your user id and click OK.
From the Start menu, select Programs > Oracle Data Quality products > Global Postal Matcher > Start Service.
From the Start menu, select Settings > Control Panel > Administrative Tools > Services and confirm that the Global service is running.
NOTE: If you need to install a new Global Postal Table, you must stop the service, install the new table, then restart the service.
2.4 Post-Installation Configuration Tasks
Depending on your installation type, you may need to manually configure some of the Oracle Data Quality components. Review the following sections to determine if additional configuration tasks are required:
2.4.1 Start the Oracle Data Quality Scheduler
If you installed the Oracle Data Profiling and Quality server, you must start the Oracle Data Quality Scheduler before you begin. The Oracle Universal Installer will start the Scheduler as part of the installation process, but you may need to start it again.
To start the Oracle Data Quality Scheduler:
Make sure that you are logged on to the Oracle Data Profiling and Quality server as the Oracle Data Quality server administrator as defined on the Metabase Server Details screen during the installation.
Navigate to the following directory:
Type the command:
For more information on using the Scheduler, see the online help for Metabase Administrators.
2.4.2 Configure Environment Variables for UNIX Operating Systems
On UNIX platforms add the following environment variables for the user who has installed Oracle Data Integrator:
Oracle_QUALITY=ODQ_HOME/oracledq/12/Software
LD_LIBRARY_PATH=ODQ_HOME/oracledq/12/Software/bin
2.4.3 Configure inetd.conf File
Oracle Data Quality products work with inetd
, a daemon process that handles network services operating on a UNIX operating system. Upon execution, inetd
reads its configuration information from a configuration file which, by default, is /etc/inetd.conf
.
If you installed the Metabase Server, the Oracle Universal Installer automatically updated the inetd.conf
file. For Linux, HP, and AIX systems, no further action is required.
If you do update the inetd.conf
file for any reason, be sure to recycle it.
On Solaris 10 and later systems, inetd
reads configuration information from a different location. If you have installed Oracle Data Profiling and Quality components on a Solaris 10 system, log on as the root user and issue the following command at the command prompt:
This command converts the data quality entry in the inetd.conf
file to the format required by Solaris 10.
This chapter describes additional tasks that may be required after you have performed the upgrade.
8.1 Update Metabase Connections
The following commands allow you to update your loader connections after migrating the repository:
updateconnection
- updates the details of a specific connectionupdateallconnections
- updates the details of all migrated connections
Oracle recommends using the updateconnection
command to update your migrated loader connections.
Metabase Oracle Connection Control
To update the metabase connections:
Start the Metabase Server Administrator Prompt
On UNIX operating systems:
On Windows operating systems:
Select Programs > Oracle > Oracle Data Profiling and Quality > Metabase Server >Administrator Command Prompt from the Start menu.
When prompted, supply your username, password, and metabase name.
Use the
_control
metabase name.Enter one of the following commands:
To update a specific migrated connection in all the metabases:
To update all migrated connections in all the metabases.
Respond to the system prompts by typing a new value and pressing Enter. To keep the current value, press Enter at the prompt.
After you respond to all the prompts, the command locates the specified connection in all the metabases and updates them with the new information.
Example of a connection update:
Start the Metabase Server Administrator and log on to the
_control
metabase:At the
_control
prompt, enter theupdateconnection
command and press Enter:The following prompt appears:
Press Enter to use the same description, or type a new description and then press Enter.
The following prompt appears:
Press Enter to use the same path, or type a new path and then press Enter. The following prompt appears:
Press Enter to use the same path, or type a new path and then press Enter. The following message appears:
Respond to the prompts for your username and password. Progress messages appear as the command updates the connections named
myconnection
in each metabase. For example:
8.2 Validate the Upgrade Operation
Start Metabase Manager and verify that your metabases and loader connections are available in the Control Admin pane.