2011-12-22 89 views
2

我目前正在試圖優化我的數據庫。問題是以下幾點: 我有一張表,當前存儲超過83Mio。時間依賴值。它們由高分辨率(ms)時間戳索引。我需要做的是計算特定值出現在給定時間間隔內的次數 - 例如說,我想知道值1.56787在時間戳x到時間戳y的時間間隔中出現了多少次。現在這需要幾乎永遠。 即時通訊使用InnoDB,我已經花了很多時間來優化配置文件,這增加了速度immensly。優化MySQL數據庫的快速計數

我很感激任何輸入,因爲我幾乎不知道如何將它關閉。我能想到的唯一解決方法是創建包含固定時間間隔的預計數值的表,由於整個事物也應該是完全可更新的(我們正在討論每隔幾毫秒到達的新值),所以這並不令人滿意。另一個數據庫系統會更適合我的問題嗎?

下面是解釋輸出:

Field Type Null Key Default Extra 

timestamp bigint(20) NO PRI NULL  
ask decimal(6,5) NO  NULL  
bid decimal(6,5) NO  NULL  
askvolume decimal(6,5) NO  NULL  
bidvolume decimal(6,5) NO  NULL  

# The MySQL server 
[mysqld] 
port= 3306 
socket= "C:/xampp/mysql/mysql.sock" 
basedir="C:/xampp/mysql" 
tmpdir="C:/xampp/tmp" 
datadir="C:/xampp/mysql/data" 
pid_file="mysql.pid" 
skip-external-locking 
key_buffer = 16M 
max_allowed_packet = 61M 
table_cache = 64 
sort_buffer_size = 512K 
net_buffer_length = 8K 
read_buffer_size = 256K 
read_rnd_buffer_size = 512K 
myisam_sort_buffer_size = 8M 
log_error="mysql_error.log" 
bind-address="192.168.1.2" 


# Don't listen on a TCP/IP port at all. This can be a security enhancement, 
# if all processes that need to connect to mysqld run on the same host. 
# All interaction with mysqld must be made via Unix sockets or named pipes. 
# Note that using this option without enabling named pipes on Windows 
# (via the "enable-named-pipe" option) will render mysqld useless! 
# 
# commented in by lampp security 
#skip-networking 
skip-federated 

# Replication Master Server (default) 
# binary logging is required for replication 
# log-bin deactivated by default since XAMPP 1.4.11 
#log-bin=mysql-bin 

# required unique id between 1 and 2^32 - 1 
# defaults to 1 if master-host is not set 
# but will not function as a master if omitted 
server-id = 1 

# Replication Slave (comment out master section to use this) 
# 
# To configure this host as a replication slave, you can choose between 
# two methods : 
# 
# 1) Use the CHANGE MASTER TO command (fully described in our manual) - 
# the syntax is: 
# 
# CHANGE MASTER TO MASTER_HOST=<host>, MASTER_PORT=<port>, 
# MASTER_USER=<user>, MASTER_PASSWORD=<password> ; 
# 
# where you replace <host>, <user>, <password> by quoted strings and 
# <port> by the master's port number (3306 by default). 
# 
# Example: 
# 
# CHANGE MASTER TO MASTER_HOST='125.564.12.1', MASTER_PORT=3306, 
# MASTER_USER='joe', MASTER_PASSWORD='secret'; 
# 
# OR 
# 
# 2) Set the variables below. However, in case you choose this method, then 
# start replication for the first time (even unsuccessfully, for example 
# if you mistyped the password in master-password and the slave fails to 
# connect), the slave will create a master.info file, and any later 
# change in this file to the variables' values below will be ignored and 
# overridden by the content of the master.info file, unless you shutdown 
# the slave server, delete master.info and restart the slaver server. 
# For that reason, you may want to leave the lines below untouched 
# (commented) and instead use CHANGE MASTER TO (see above) 
# 
# required unique id between 2 and 2^32 - 1 
# (and different from the master) 
# defaults to 2 if master-host is set 
# but will not function as a slave if omitted 
#server-id  = 2 
# 
# The replication master for this slave - required 
#master-host  = <hostname> 
# 
# The username the slave will use for authentication when connecting 
# to the master - required 
#master-user  = <username> 
# 
# The password the slave will authenticate with when connecting to 
# the master - required 
#master-password = <password> 
# 
# The port the master is listening on. 
# optional - defaults to 3306 
#master-port  = <port> 
# 
# binary logging - not required for slaves, but recommended 
#log-bin=mysql-bin 


# Point the following paths to different dedicated disks 
#tmpdir = "C:/xampp/tmp" 
#log-update = /path-to-dedicated-directory/hostname 

# Uncomment the following if you are using BDB tables 
#bdb_cache_size = 4M 
#bdb_max_lock = 10000 

# Comment the following if you are using InnoDB tables 
#skip-innodb 
innodb_data_home_dir = "C:/xampp/mysql/data" 
innodb_data_file_path = ibdata1:10M:autoextend 
innodb_log_group_home_dir = "C:/xampp/mysql/data" 
#innodb_log_arch_dir = "C:/xampp/mysql/data" 
## You can set .._buffer_pool_size up to 50 - 80 % 
## of RAM but beware of setting memory usage too high 
innodb_buffer_pool_size = 1024M 
innodb_additional_mem_pool_size = 20M 
## Set .._log_file_size to 25 % of buffer pool size 
innodb_log_file_size = 5M 
innodb_log_buffer_size = 16M 
innodb_flush_log_at_trx_commit = 0 
innodb_lock_wait_timeout = 50 

[mysqldump] 
quick 
max_allowed_packet = 16M 

[mysql] 
no-auto-rehash 
# Remove the next comment character if you are not familiar with SQL 
#safe-updates 

[isamchk] 
key_buffer = 20M 
sort_buffer_size = 20M 
read_buffer = 2M 
write_buffer = 2M 

[myisamchk] 
key_buffer = 20M 
sort_buffer_size = 20M 
read_buffer = 2M 
write_buffer = 2M 

[mysqlhotcopy] 
interactive-timeout 

喔機器是i7-950的RAM 6GB和系統+數據庫上的SSD。所以我認爲這不應該是問題?

感謝您的幫助,我們將非常感謝!

+0

在這個問題不是一種選擇砸錢?你可以做很多事情來提高性能,而更多的錢絕對不是我的第一選擇......但是有些觀點你必須得到更好的硬件。根據你的描述,你似乎越來越接近這一點。這就是說...我強烈建議給(配置良好的)Postgres一槍。 – shesek 2011-12-22 09:05:22

+0

沒有看到用於運行MySQl實例的表格和硬件 - 很難提出任何建議。我懷疑Postgres會在I/O綁定系統上做出什麼改變,尤其是因爲知道InnoDB的B-tree實現是業界最好的實現之一。以遠距離交換整個RDBMS並不是一個可行的選擇。我敢打賭,MySQL配置不正確(SHOW VARIABLES LIKE'%innodb%'和'EXPLAIN ...'的輸出不存在)。如果您可以發佈這些信息,那麼分析出現問題會更容易。 – 2011-12-22 09:17:46

+0

感謝球員們,我把所有的信息都加入到了描述中......如果有必要,它不應該成爲一個問題,但是當我貼出來的時候,它是一個帶有ssd和6GB內存的i7 ... – user871784 2011-12-22 11:45:31

回答

-1

第一步:如果您尚未完成此操作,請使用explain plan查看查詢的瓶頸究竟是什麼,以及引擎是否正確使用索引。

第二步:按時間戳範圍對錶進行分區。我不確定MySQL/InnoDB是否具備這種能力,但如果沒有,您最好更換DBMS。

在任何情況下,MySQL都不是高性能的好選擇:根據您的需求,使用Oracle或Postgre或者甚至內存存儲可能會更好(尤其是如果您不太在意的話)爲了安全而不是表現)。

+0

InnoDB在內存中存儲工作數據集,其工作速度比內存中特定的解決方案快得多。 Postgres或Oracle不會更快。這是一個優化系統I/O並查看查詢花費多長時間的問題。 – 2011-12-22 09:19:08

+0

我真誠地懷疑InnoDB比H2更快(他們說它不是,它的價值:http://www.h2database.com/html/performance.html)。儘管如此,清晰地轉換RDBMS是需要考慮的另一件事,而不是最簡單的解決方案,但值得一提的是。 – Viruzzo 2011-12-22 11:07:15

+0

InnoDB @ 5.6速度很快,並且您鏈接的基準測試表明,MySQL在他們的情況下每秒執行140條語句,這比我的小型測試xeon和7200rpm驅動器的速度要慢。我不相信其他系統的開箱即用基準。再次,有專門針對MySQL的基於列的分析引擎,其性能比Oracle快。問題是爲什麼數據庫的數據少於1億個。記錄執行速度如此之慢,我知道一個事實,即如果不更改數據庫系統,可以在InnoDB上獲得非常快的結果。 – 2011-12-22 11:29:58

0

我沒有感覺你在你的索引時間戳值的值範圍,但在我看來,partitioning你的表可以幫助你在這裏。具體爲RANGE partitioningHASH partitioning

這應該會顯着提升性能。

+0

哈希分區是不可取的:它確保數據在分區中的均勻分佈(作爲散列),但對於間隔查詢(如他所做的那樣),它不提供特別的好處,而範圍分區最明顯。 – Viruzzo 2011-12-22 11:01:22

+0

謝謝大家的建議,我想我現在有很多東西要挖掘! – user871784 2011-12-22 11:30:03