Results 1 to 2 of 2

Thread: Excel file with duplicated column names

  1. #1

    Default Excel file with duplicated column names

    I am trying to process a number of Excel files which have multiple duplicates of the same column name. For example, the header of the file looks something like this:

    Name, Phone, Address, Name, Phone, Address

    I am getting this error, as Lavastorm won't allow me to read in the file which has more than one column with same name:

    ERROR: Duplicate field name encountered while calculating metadata. Field "Name" is defined at both column 1, and 4 in worksheet in (0): "Sheet1", in file: /raw_data/File-20150317.xlsx.

    The only way for me to process the files is to rename duplicated columns, but it's a very time consuming task, considering that there are a few hundred files in a directory that I'd like to read in. Any ideas how to get around that?

    P.S. I'm using BRE v 4.5.3.0 Build 365 and upgrade to the latest version is not an option, as it is on the company's server.

  2. #2
    Lavastorm Employee
    Join Date
    Apr 2014
    Location
    London
    Posts
    57

    Default Metadata Options

    Hi,

    I dont have access to a v4.5 system, but v4.6 includes an option in the exception tab on how to treat duplicate field names. If this option isn't in v4.5 then you could use the get metadata and change metadata options as a quicker alternative to renaming the columns. I've attached an example graph which includes examples of these options: -

    Change Metadata Examples.brg

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •