[데이터 아키텍처] DA 구축 예제 | 2. 현행 DB 준비 (feat. 파이썬)

import pandas as pd
import cx_Oracle as co
from datetime import datetime

def main():
    # read config file
    cfg_path = 'C:/Users/pgsus/Documents/Python Scripts/data/전국병의원 및 약국현황 2023.03/'
    cfg_file = '병의원정보.cfg'

    xls_list = []
    f = open(cfg_path + cfg_file, 'r', encoding='UTF8')
    f_lines = f.readlines()
    for fl in f_lines:
        fl = fl.strip()
        xls_list.append(fl)
    f.close()

    # oracle client library 설정 & connect
    co.init_oracle_client(lib_dir=r"C:\Users\pgsus\Downloads\instantclient_19_10")
    con = co.connect('soctt', 'tiger', '192.110.110.110:1521/orcl') # ip 임의 변경 

    # open cursor
    cur = con.cursor()

    # 엑셀 파일을 순차적으로 read 및 DB insert
    for xf in xls_list:
        print('starting ...', xf)
        df = pd.read_excel(cfg_path + xf).fillna('')
        # 엑셀 파일에서 table명 생성
        table_nm = xf.replace('.xlsx','').replace('.','_')
        # column명 추출
        col_lst = df.columns.values.tolist()
        # CREATE TABLE 구문 생성
        ct_stmt = "CREATE TABLE " + table_nm + " ( "
        # insert의 values (:1, :2 .. ) 구문 생성
        ins_values = " VALUES ("
        # table의 컬럼 및 datatype 정의
        for i, col in enumerate(col_lst, start=1):
            # 컬럼명 변환 : 예) 의과일반의 인원수 -> 의과일반의_인원수, 좌표(X) -> 좌표_X
            ct_stmt += col.replace(' ','_').replace(')','').replace('(','_')
            # VALUE (:1, :2 ...) 구문 생성
            ins_values += ':' + str(i)

            if df[col].dtypes == 'object':
                ct_stmt += ' VARCHAR2(500)'
            elif df[col].dtypes == 'int64':
                ct_stmt += ' NUMBER'
            elif df[col].dtypes == 'float64':
                ct_stmt += ' NUMBER'
            else:
                ct_stmt += ' VARCHAR2(200)'
            if col != col_lst[-1]:      # check last element of list
                ct_stmt += ', '
                ins_values += ','
            else:
                ct_stmt += ') '
                ins_values += ') '
            i += 1
        # execute Create Table statement
        try:
            cur.execute(ct_stmt)
            print('Create table ', table_nm, '... completed.')
        except co.DatabaseError as e:
            error, = e.args
            if error.code == 955:
                print('table already exists')
            else:
                print(error.message)
        # insert 구문 생성
        ins_stmt = "INSERT INTO " + table_nm + ins_values
        # convert pandas to list
        rows = df.values.tolist()
        # insert 수행
        for i, row in enumerate(rows):
            cur.execute(ins_stmt, row)
        print('rows inserted: ', i+1)
        # commit
        con.commit()
        print('commit : ', datetime.now().strftime('%Y-%m-%d %H:%M:%S'))
    # close
    cur.close()
    con.close()
    
if __name__ == '__main__':
    s_time = datetime.now()
    print('start time :', datetime.now().strftime('%Y-%m-%d %H:%M:%S'))
    main()
    e_time = datetime.now()
    print('end_time : ', e_time.strftime('%Y-%m-%d %H:%M:%S'))
    elapsed_time = e_time - s_time
    print('elapsed_time(s) : ', elapsed_time.total_seconds())

생성된 테이블의 결과는 아래와 같다.

'데이터 아키텍처' 카테고리의 다른 글

[데이터 아키텍처] DA 구축 예제 \| 5. 현행 개념 데이터 모델링 (0)	2024.05.21
[데이터 아키텍처] DA 구축 예제 \| 4. 현행 논리 데이터 모델링 (1)	2024.05.19
[데이터 아키텍처] DA 구축 예제 \| 3. 리버스 모델링 (0)	2024.05.16
[데이터 아키텍처] DA 구축 절차 (0)	2024.05.15
[데이터 아키텍처] DA 구축 예제 \| 1. 개요 - 오픈 데이터 활용 (0)	2024.05.14

박선생의 블로그

[데이터 아키텍처] DA 구축 예제 | 2. 현행 DB 준비 (feat. 파이썬)

[데이터 아키텍처] DA 구축 예제 | 2. 현행 DB 준비 - 파일 업로드 (feat. 파이썬)

'데이터 아키텍처' 카테고리의 다른 글

티스토리툴바

[데이터 아키텍처] DA 구축 예제 | 2. 현행 DB 준비 (feat. 파이썬)

[데이터 아키텍처] DA 구축 예제 | 2. 현행 DB 준비 - 파일 업로드 (feat. 파이썬)

'데이터 아키텍처' 카테고리의 다른 글

관련글

티스토리툴바