Commit Graph

1595 Commits

Author SHA1 Message Date
Kelly McLaughlin
274fc6648b Merge branch 'er/verify_dt_upgrade' 2014-04-23 06:57:32 -06:00
Kelly McLaughlin
f045c2e093 Call verify_counters during the upgrade 2014-04-23 06:55:06 -06:00
Kelly McLaughlin
48972b58a6 Merge branch 'bugfix/rt-rpc-fixes' 2014-04-22 07:13:17 -06:00
Kelly McLaughlin
5fd64a98e8 s/badprc/badrpc 2014-04-22 06:46:23 -06:00
Kelly McLaughlin
9d2c2c3997 Fix refactoring bug in wait_until_all_members function 2014-04-22 06:38:36 -06:00
Kelly McLaughlin
20c0b76c3f Change rpc handling in rt module to handle badrpc returns
Change how the return from some rpc calls is handled in the rt module
in order to avoid spurious or misleading test failures.
2014-04-22 06:38:36 -06:00
Joe DeVivo
3455d26c71 Merge pull request #580 from basho/bugfix/jd/rtssh-set-backend
./riak_test -b backend now works with rtcloud
2014-04-20 11:50:59 -07:00
Christopher Meiklejohn
764f394251 Merge pull request #582 from basho/bugfix/replication2-nval1-request-failure
Wait for transfers to complete in replication2_pg:test_pg_proxy
2014-04-20 17:50:38 +01:00
Kelly McLaughlin
823dd2fe9c Wait for transfers to complete in replication2_pg:test_pg_proxy
Wait for transfers to complete in
replication2_pg:test_pg_proxy. Replication tests that test the n_val=1
request option can fail with insufficient_vnodes errors if the cluster
setup does not include waiting for transfers to complete. Change the
test_pg_proxy test case to wait until transfers complete on the "A"
and "B" clusters before proceeding.
2014-04-17 11:05:41 -06:00
Joe DeVivo
7a48a7be20 ./riak_test -b backend now works with rtcloud 2014-04-16 16:33:52 -07:00
Andrew Thompson
9998807bc8 Merge pull request #560 from basho/adt-add-logging
Add a log/test.log that mirrors the console output
2014-04-16 17:32:24 -04:00
Andrew Thompson
a68559adc2 Merge pull request #574 from basho/adt-fix-cover
Shutdown cover when setup_harness is run
2014-04-16 17:32:11 -04:00
John Daily
aa91baf273 Merge pull request #549 from basho/jrd-bucket-types-naming
Update tests for UTF-8 bucket types (and buckets)
2014-04-16 11:59:59 -04:00
Jared Morrow
76cc312f31 Change previous riak release to 1.4.8 rather than 1.4.2 2014-04-16 07:29:55 -06:00
Kelly McLaughlin
8e5705595a Merge branch 'bugfix/consistent-object-filter-test-fix' 2014-04-15 15:40:31 -06:00
Kelly McLaughlin
5b66c7a354 Add n_val to bucket type specificaitons 2014-04-15 15:08:23 -06:00
Kelly McLaughlin
75f5165078 Fix bug with assigment of LeaderB 2014-04-15 15:08:04 -06:00
Andrew J. Stone
8246b1a090 Force update ring in ensemble_util:build_cluster()
Once riak_ensemble_manager:enable() is called, we need to call
riak_core_ring_manager:force_update() so that the members will be
created and added to the ensembles trying to get a quorum. During ticks
in core, new members are created only if the ring has changed. There is
a race that can sometimes prevent the members from starting and thus the
quorum from being achieved indefinitely. This small change to the test
infrastructure resolves this issue, but it still requires fixing in
riak_core and/or riak_kv.
2014-04-15 16:06:47 -04:00
Andrew J. Stone
d0c7f2998e Call riak_core_manager:force_update() in test
repl_consistent_object_filter calls riak_ensemble_manager:enable() which
fails to bootstrap the ensemble because the ring has stabilized already.
An issue for this will be opened in riak_kv, but this quick fix will
allow the test to get beyond that point.
2014-04-15 15:49:36 -04:00
Kelly McLaughlin
aca372d627 Initial refactoring to use ensemble_util 2014-04-14 14:09:13 -06:00
Joseph Blomstedt
53e339c913 Merge pull request #573 from basho/jdb-ensemble-sync 2014-04-11 09:39:22 -07:00
Kelly McLaughlin
1651adbd36 Merge branch 'bugfix/fix-verify-busy-dist-port' 2014-04-11 09:14:32 -06:00
Andrew Thompson
b782127edb Shutdown cover when setup_harness is run 2014-04-11 10:23:36 -04:00
Joseph Blomstedt
da7b81b7b3 Merge pull request #569 from basho/jdb-ensemble-overhaul 2014-04-10 23:50:08 -07:00
Joseph Blomstedt
ab5a4a6e4a Add additional ensemble tests to test peer syncing
Add ensemble_basic4, ensemble_sync, and ensemble_interleave tests.

ensemble_sync tests the new AAE-based peer syncing logic. The test
checks various scenarios with different levels of data corruption.

ensemble_interleave tests a specific scenario where two peers become
corrupted one after the other. This tests the scenario where the
second peer becomes untrusted while the first peer may be syncing
with it.
2014-04-10 19:07:02 -07:00
John R. Daily
42265b9d43 Remove unnecessary sleep per @jrwest 2014-04-10 21:53:32 -04:00
John Daily
9d572c24f6 Merge pull request #555 from basho/jrd-security-encoding
Changes for UTF-8 support in users and groups (plus lower-case grant syntax)
2014-04-10 16:59:15 -04:00
Steve Vinoski
6132e22c83 Merge pull request #568 from basho/vinoski/nicer-kills2
further process management changes
2014-04-10 16:48:03 -04:00
Steve Vinoski
25a3b28782 further process management changes
Increase timeout for waiting for init:stop/0 to stop nodes from an extra
second to 10 extra seconds. Be sure to wait until the timeout expires
before using kill on any nodes that fail to stop. To avoid unconditionally
waiting the full timeout period, use kill -0 where possible to watch for
nodes stopping. Use kill -9 only after the full timeout period has elapsed
and the node still hasn't stopped. Fix setting of the cookie when
converting the riak_test node to a distributed node.
2014-04-09 20:55:55 -04:00
Kelly McLaughlin
12cd13a575 Resolve failure of verify_busy_dist_port test on Riak 2.0
As of Riak 2.0 the vm.args zdbbl setting defaults to 32768. Previously
the default of 1024 was used.  Change the cause_bdp helper module for
the verify_busy_dist_port test to be more aggressive in order to
trigger a busy_dist_port message with the higher zdbbl setting.
2014-04-09 16:24:00 -06:00
Kelly McLaughlin
01eef1a556 Merge branch 'refactor/remove-dynammic-ring-force-replace' 2014-04-09 14:23:16 -06:00
Kelly McLaughlin
abad8c44ff Merge branch 'bugfix/replication-test-failures' 2014-04-09 13:58:42 -06:00
Kelly McLaughlin
b991660e67 Remove the force-replace test case from verify_dynamic_ring
There is a race condition that can cause the force-replace test case
in the verify_dynamic_ring test to fail. This issue is being tracked
by riak_core issue #570.  This changes replaces the force-replace
testing with another resize test. Once issue #570 is resolved this
change can be reverted.
2014-04-09 13:36:25 -06:00
Joseph Blomstedt
40f501641c Add basic riak_ensemble tests
Add ensemble_basic, ensemble_basic2, and ensemble_basic3 tests.

These tests test that Riak correctly generates proper consensus
groups, these groups reach quorum, handle leader failures, etc.

ensemble_basic3 tests basic consistent K/V API as well as behavior
during simple network partitions.
2014-04-08 17:41:11 -07:00
Christopher Meiklejohn
dcb7c18461 Adapt to use new clustering API. 2014-04-07 18:06:57 +00:00
Russell Brown
48c5904a3d Merge pull request #565 from basho/rdb/dvv-bucket-prop
Set dvv to true for default bucket under test
2014-04-07 16:52:59 +01:00
Russell Brown
3ee9fe416d Remove extraneous sleep 2014-04-07 10:08:56 +01:00
Kelly McLaughlin
0a36f9974c Fix race conditions and fullsync failures in replication test
Fix some race conditions in the cluster leader helper functions. Also
re-initiate fullsync after a certain number of checks for
completion. V2 replication has problems where calling
riak_repl_console:start_fullsync is basically ignored and needs to be
retried.
2014-04-04 16:10:01 -06:00
Steve Vinoski
93e2a065bf Merge pull request #561 from basho/vinoski/nicer-kills
try to avoid kill -9 for stopping nodes
2014-04-04 15:17:39 -04:00
Steve Vinoski
6b2537fd2c try to avoid kill -9 for stopping nodes
Using kill -9 on a node can leave it in a zombie process state, stuck in a
system call never to return. OS X Mavericks seems especially vunerable to
this problem. Only a reboot can clear out such zombies. Change
rt:brutal_kill/1 to try a normal kill -15 first, and set a 5 second timer
to perform a kill -9 if the normal kill doesn't work. Change
rtdev:stop_all/1 to first try to connect to the nodes to shut them down via
an init:stop/0 rpc, and if that fails attempt to stop them via "riak stop"
instead. Then, ps is used to check for any stragglers and those are killed
via kill -15, wait 5 seconds, kill -9.
2014-04-04 11:05:47 -04:00
Russell Brown
3999e849d8 Set dvv to true for default bucket under test 2014-04-04 14:25:50 +01:00
Christopher Meiklejohn
d8064f10b7 Merge pull request #564 from basho/bugfix/csm/replication2_pg
Adapt test for rtcloud/rtdev.
2014-04-03 16:22:50 -04:00
Christopher Meiklejohn
dd0581fe06 Provide user helper for running all of the tests. 2014-04-03 19:35:14 +00:00
Christopher Meiklejohn
e82ed47639 Ensure that if we have a list of clusters, we log correctly. 2014-04-03 19:25:18 +00:00
Christopher Meiklejohn
6db48fc5d0 Adapt test for rtcloud/rtdev. 2014-04-03 19:17:39 +00:00
Christopher Meiklejohn
afa4eed746 Merge pull request #563 from basho/bugfix/csm/ownership
Improve reliability of test.
2014-04-03 13:29:37 -04:00
Christopher Meiklejohn
c6eb0d4827 Improve reliability of test.
Improve the reliability of the proxy_get test by asserting that
ownership transfer completes before killing the leader node.

Related to basho/riak_repl#352.
2014-04-02 14:58:41 -04:00
Joseph Blomstedt
1b7a65d1fc Merge pull request #544 from basho/feature/rtssh
Add rtcloud support; rtssh harness.
2014-03-31 14:54:05 -07:00
Joseph Blomstedt
5087b8ef2b Delete two spurious comment lines from rt.erl 2014-03-31 14:43:10 -07:00
Andrew Thompson
5ebb2dadac Add a log/test.log that mirrors the console output
This is extremely handy when debugging riak_test issues, since the
actual test logs can omit messages between tests and also be hard to
reassemble.
2014-03-31 13:21:26 -04:00