Commit Graph

1011 Commits

Author SHA1 Message Date
Kelly McLaughlin
87ee6f5883 Fix replication_object_reformat test failure
Part of the condition checking done in the replication_object_reformat
test is to validate the results of a fullsync using
repl_util:validate_completed_fullsync/6. The way in which the the
function is called from the test expects fullsync to complete with 0
error_exit or retry_exit conditions occurring. This requires that sink
cluster be in a steady state with all partitions available. The test
failed to wait for such conditions to occur and instead relied on
performing a node downgrade asynchronously and waiting for up to 60
seconds for a completion message before continuing with the test. The
test was continually failing after a node was downgraded to `previous`
due to partitions being reported as `down` on that node. To resolve
the issue the node downgrade process is now done in the primary test
process instead of in a separate spawned process. After the version
downgrade is complete, the test now waits for the riak_repl and the
riak_kv services, calls rt:wait_until_nodes_ready/1, calls
rt:wait_until_no_pending_changes/1, and finally waits for the
riak_repl2_fs_node_reserver named process to be registered on the
downgraded node. This process is responsible for handling partition
reservation requests and is key to determining the the new node is
able to handle a fullsync without partition errors.
2014-06-18 15:55:27 -06:00
Kelly McLaughlin
c55e473b97 Merge branch 'feature/update-repl-systest-read-use' 2014-06-18 15:52:48 -06:00
Kelly McLaughlin
2f9a3cae4a Update calls to rt:systest_read to handle identical siblings
Update the calls to rt:systest_read in repl_util and
repl_aae_fullsync_util to treat identical siblings resulting from the
use of DVV as a single value.  These changes are specifically to
address failures seen in the repl_aae_fullsync_custom_n and
replication_object_reformat tests, but should be generally useful for
replication tests using the utility modules that and that have
allow_mult set to true.
2014-06-18 14:33:44 -06:00
Andrew J. Stone
7d0301db35 add intercept for riak_kv_ensemble_backend:handle_down/4 in ensemble_vnode_crash 2014-06-17 23:13:44 -04:00
Andrew J. Stone
6c14c7c371 Add test to kill a vnode and vnode proxy
Kill a vnode and it's proxy for a given key and ensure that operation
reads succeed afterwards.
2014-06-17 17:57:15 -04:00
John Burwell
6733c099c8 Merge pull request #636 from basho/bugfix/jsb/start-ensemble-without-aae
Verify Riak Startup when Strong Consistency is Misconfigured
2014-06-16 09:30:33 -04:00
Micah
c96f318f6a Merge pull request #643 from basho/bugfix/mw/better-isolate-pb_security-certs
isolate certs created for the pb_security tests.
2014-06-12 17:30:36 -05:00
Micah Warren
f7631b42c3 pb_cipher_suites test creates certs in its own dir.
Same reason as pb_security and http_security: to keep other tests
from stomping on it.
2014-06-12 17:22:42 -05:00
Micah Warren
f96847beb8 isolate certs created for the pb_security tests.
This should prevent other tests from interfering in its execution
2014-06-12 17:18:15 -05:00
Kelly McLaughlin
0589935931 Fix problems with cert specifications causing replication_ssl to fail
Fix problem with cacertdir specification in replication_ssl test. The
code used load cert files in v2 replication expects the path specific
by the cacertdir key to only be a directory. With v3 replication the
code used is flexible enough to allow a directory or a file. Also
correct a typo in the certfile path for the SSLConfig1 configuration.
2014-06-12 12:38:58 -06:00
Kelly McLaughlin
5f5c3ac035 Merge branch 'bugfix/replication-upgrade-fixes' 2014-06-12 10:39:53 -06:00
Kelly McLaughlin
21b64526f1 Fix two issues with replication_upgrade test
* Do not attempt to cancel fullsync if the initial attempt to start
  and wait for completion fails. It has not been observed that the
  problem is fullsync starting and not completing in time, but rather
  the issue is that the initial call to start fullsync does not take
  effect. Therefore the cancellation is unnecessary.
* Replace the call to repl_util:wait_for_connection/2 in the node
  upgrade process with a call to
  replication:wait_until_connection/1. This function is geared towards
  v2 replication and should speed up test execution.
2014-06-11 21:53:29 -06:00
Micah
2c5def132c Merge pull request #638 from basho/bugfix/mw/pb_security-using-removed-function
Fixed map crdt creation
2014-06-11 13:50:09 -05:00
Micah Warren
3067209a97 Fixed map crdt creation
riakc_map:add/2 no longer exists, so updated the creation of that key to
use the correct update semantics.
2014-06-11 13:25:13 -05:00
Kelly McLaughlin
0e2b52d8b1 Fix timing issue with jmx_verify test
Replace use of a 40 second sleep in the test_supervision test case
with a wait condition to better handle variances in the time it takes
to progress through 10 retry attempts.
2014-06-11 11:26:45 -06:00
Kelly McLaughlin
0601cd594f Merge branch 'bugfix/replication-upgrade-return-term' 2014-06-10 17:14:31 -06:00
John Burwell
6d8c504dba - Verifies Riak startup behavior when strong consistency is enabled and
AAE is disabled.  (defect https://github.com/basho/riak_kv/issues/959)
- Adds additional console output to reset-current-env to explain
  configuration and steps being executed
- Adds the -n option to the reset-current-env script to specify the
  number of nodes to build.  By default, 5 will be created.
2014-06-10 15:01:10 -04:00
Kelly McLaughlin
ba4db5ac74 Change replication upgrade tests to return pass on success
As of commit 3044839456 tests that
return something other than the prescribed success atom 'pass' to
indicate success result in test failure. Change the
replication_upgrade and replication2_upgrade tests that return the
result of the a call to lists:foreach/2 to instead return 'pass' to
indicate success.
2014-06-10 12:58:57 -06:00
Joseph Blomstedt
4322795d8f Merge pull request #632 from basho/jdb-auto-ensemble
Update ensemble tests to work with auto-activation
2014-06-06 16:45:43 -07:00
Engel A. Sanchez
cf10cf96a4 Merge pull request #594 from basho/feature/bitcask-tombstone2-upgrade
Verify Bitcask tombstone 2 upgrade
2014-06-06 10:34:15 -04:00
Joseph Blomstedt
65e15a50ad Update ensemble tests to work with auto-activation
Prior to this commit, the various riak_ensemble related tests would
manually enable the consensus system on one-and-only-one node in a
given cluster in order to work around issue basho/riak_core#571.

This commit changes the tests to work properly after the above issue
has been fixed.

In addition to removing the call to riak_ensemble_manager:enable()
that is now handled automatically by Riak, this commit also removes
a few wait_until_stable/2 checks against 1-node clusters. These
checks no longer apply, since Riak is now designed to only enable
the consensus system after the cluster contains at least 3 nodes.
2014-06-05 16:43:12 -07:00
Christopher Meiklejohn
e07c5eb3f6 Merge pull request #626 from basho/bugfix/bjs/riak_repl_cancelled_connections_cleanup
Disconnect and cancelled connection tests
2014-06-05 16:09:34 -04:00
Christopher Meiklejohn
b208394f63 Adapt test with better assertions and new cluster API. 2014-06-05 16:08:08 -04:00
Micah
ce1969d53b Merge pull request #621 from basho/bugfix/mw/pb-cipher-suites-and-http-security
Change http_security uses different directory for certs then pb_cipher_suites
2014-06-05 11:34:28 -05:00
bsparrow435
a5746cb416 Adding re-connection test
Adding onto the existing test to prove that a reconnection to the same
endpoint can occur after the cancelled connection has been removed.
2014-06-04 21:46:57 -04:00
bsparrow435
c89de8dac9 Address PR comments
Changed intercept to explicitly return `{error, econnrefused}`. Moved
helper functions to `repl_util` and added a new helper to distinguish
between disconnects on `cluster_by_name` and `cluster_by_address`
connections.

Added asserts to all wait_for functions.
2014-06-04 19:51:21 -04:00
Joseph Blomstedt
100180e7ff Merge pull request #593 from basho/ajs/ensemble_remove_node 2014-06-04 11:05:34 -07:00
Joseph Blomstedt
f822e52fe8 Extend ensemble_remove_node2 to test shutdown
ensemble_remove_node2 uses an intercept to prevent a riak_ensemble
related transition that is necessary for nodes to completely exit and
shutdown after removal. In fact, testing for this scenario is the
entire point of this test, since it is testing logic that was added to
solve basho/riak_core#572 and that logic prevents nodes from exiting
until that transition occurs.

However, even without this new logic, there is an unrelated
riak_ensemble related bug that can trigger a race condition that also
prevents nodes from shutting down.

The good news is that other changes made as part of the solution to
solve basho/riak_core#572 also fix this unrelated bug. Therefore this
commit extends ensemble_remove_node2 to remove the intercept at the
end of the test and verify that the removed nodes do actually end up
exiting as expected. Thus, the test now tests for both the negative
and positive scenarios and serves as a test against future regressions
that stall node removal/shutdown.
2014-06-03 15:50:06 -07:00
John Burwell
610c1a6ab8 Merge pull request #627 from basho/bugfix/jsb/fix-list-keys-missing-bucket-type-client
Test the operation of list keys and buckets for undefined bucket types
2014-06-03 16:59:47 -04:00
Engel A. Sanchez
27b15c8d37 Merge pull request #630 from basho/feature/wait-for-bucket-props
Add wait until bucket type visible
2014-06-03 16:35:56 -04:00
Evan Vigil-McClanahan
7f0b898e33 Merge pull request #625 from basho/membackend-test
add expanded memory backend tests
2014-06-03 11:50:01 -07:00
Engel A. Sanchez
595c13019d Add wait until bucket type visible
Adding a test to verify a bucket type is visible from a number of nodes
since the active status is given as long as the claimant sees it. But
requests to other nodes can end up hitting the dreaded {error, no_type}.
Also added a general utility that can be used for bucket type checks and
for general verification of bucket properties across nodes.
2014-06-03 14:03:24 -04:00
John Burwell
36741bb977 - Adds tests to verify the operation of list keys and list buckets when
an undefined bucket type is specified.  (defect #875)
- Adds a description of the reset-current-env.sh script and its
  usage to README.md
- Corrects a spelling mistake in an information message emitted by
  the reset-current-env.sh script
2014-06-03 13:44:05 -04:00
Micah Warren
fc13bbcf3c Added cacert to http_security path.
While r16b02-basho5 did not need the cacertfile path put in, r16b03 did.
The test still passes r16b02-basho5 with the added cacertfile line. Since
there is no harm in putting it in, better for forwards compatibility than
not.
2014-06-03 12:41:35 -05:00
Andrew J. Stone
2c7fc9fb4b Merge pull request #628 from basho/bugfix/ensemble-timouts
increase timeouts for SC operations in ensemble tests
2014-06-03 13:29:49 -04:00
Sean Cribbs
3ffab65863 Remove clients tests because they are already running on every commit on builders. 2014-06-03 11:44:07 -05:00
Micah
b14d093ce5 Merge pull request #589 from basho/vinoski/http_security_cluster
remove localhost dependencies in http_security test
2014-06-03 11:31:00 -05:00
evan
b431ba19a1 reorder tests to avoid test-breaking side-effects 2014-06-02 15:23:33 -07:00
evan
0295115e14 fix for ee versions 2014-06-02 13:53:35 -07:00
evan
bdeb8a4138 fix coyote error 2014-06-02 13:40:21 -07:00
Andrew J. Stone
a35be4bf28 increase timeouts for SC operations in ensemble tests 2014-06-02 16:40:19 -04:00
evan
131ee7bf2f add debugging info 2014-06-02 13:25:44 -07:00
Micah Warren
2b6c86b8bc Removed reference to outdated add-map feature. 2014-06-02 15:18:21 -05:00
evan
492f52ce6c try to make the eviction check fully deterministic 2014-06-02 11:01:17 -07:00
bsparrow435
941efca4ac Disconnect and cancelled connection tests
Adding tests and intercepts for connection_mgr to test disconnect and
cancelled connection cleanup.
2014-06-02 13:03:11 -04:00
Sean Cribbs
98b180ae4c Merge pull request #611 from basho/feature-bug/rdb/removes-remove
Update dt test for reset-remove semantic
2014-06-01 15:42:10 -05:00
evan
5d5cdce079 add expanded memory backend tests 2014-05-31 09:17:41 -07:00
Jon Anderson
27390fb2e2 Merge pull request #624 from basho/bugfix/jra/rex
Add test for safe_rpc when rex is down
2014-05-30 16:44:35 -04:00
Jon Anderson
110bf7e1ab Restart rex and make sure rpc works again. 2014-05-30 15:44:06 -04:00
Jon Anderson
8623a7415c Add riak_test to exercise riak_core_util:safe_rpc rex problem fix. 2014-05-30 14:55:34 -04:00