Commit Graph

1136 Commits

Author SHA1 Message Date
Kelly McLaughlin
9c5daf0f31 Fix failure of replication_ssl test introduced by 297090d
Change replication_ssl to use the wait_for_site_ips function from the
replication module introduced in
297090ded6 instead of the defunct
verify_site_ips function.
2014-07-14 12:37:32 -06:00
Russell Brown
a212b99a75 Update expected return to match change in riak_pb API
See 2b68a97710
for details.
2014-07-14 17:14:51 +01:00
Joseph Blomstedt
695853cc94 Merge pull request #657 from basho/bugfix/ensemble-interleave-error-failed
Fix ensemble_interleave error condition
2014-07-11 21:42:59 -07:00
Andrew J. Stone
f0643db473 Fix ensemble_sync by allowing {error, <<"failed">>
Allow {error, <<"failed">>} as an error response in ensemble_sync. Fixes
the test with basho/riak_ensemble#37 and basho/riak_kv#1002
2014-07-11 18:11:30 -04:00
Andrew J. Stone
38bd8399d1 Fix ensemble_interleave error condition
Include {error, <<"failed">>} as allowed failure so that test passes
with changes for basho/riak_ensemble#37 and basho/riak_kv#1002
2014-07-11 17:41:48 -04:00
Kelly McLaughlin
b59fb48611 Expand the overload test to include strong consistency
Change the overload test to exercise the strongly consistent code
paths in addition to the eventually consistent paths during overload
conditions.
2014-07-11 13:23:48 -06:00
Kelly McLaughlin
297090ded6 Avoid a race condition in the replication test module
Avoid a race condition in the replication test module when checking
for site IP addresses in the replication status output.  The test
waits for a connection on the leader, but it only queries the
replication status to check for the expected site IP addresses a
single time. Change the test to wait and re-check the status output to
give greater assurance that if the expected site IP addresses are not
present it is due to legitimate failure and not a race condition in
checking the replication status. This change affects the replication
and replication_upgrade tests as well as any other tests that call the
replication:replication function.
2014-07-01 16:36:45 -06:00
Jon Anderson
054c015d10 Merge pull request #651 from basho/feature/jra/verify_listkeys_eqcfsm
Expand verify_listkeys_eqcfsm to track varying buckets and n_vals.
2014-07-01 13:19:41 -04:00
Sean Cribbs
17b94da468 Fix nitpick about new/old API from #653. 2014-07-01 08:53:00 -05:00
Christopher Meiklejohn
f085f70169 Merge pull request #653 from basho/features/csm/crdt-capability
Prevent autoreconnect problem.
2014-06-30 18:29:05 -04:00
Christopher Meiklejohn
da34719fe3 Prevent autoreconnect problem.
Prevent a situtation where the auto-reconnect hasn't triggered yet
causing the result to be an error, instead of ok, on the next operation
after reconnecting.  Force a disconnect and reconnect to make sure the
test is deterministic.
2014-06-30 17:22:20 -04:00
Jon Anderson
7c2d7cc827 Expand verify_listkeys_eqcfsm to track varying buckets and n_vals. 2014-06-26 16:13:58 -04:00
Eric Redmond
0eb2d1c443 Merge pull request #650 from basho/er/yz-ensemble-test
Test that ensemble delete functions in yokozuna
2014-06-23 17:38:09 -07:00
Engel A. Sanchez
3662965705 Merge pull request #649 from basho/feature/ensembles-wait-for-riak-kv
Feature/ensembles wait for riak kv
2014-06-23 14:24:12 -04:00
Eric Redmond
266f9858eb Test the ensemble delete function 2014-06-20 14:44:30 -07:00
Engel A. Sanchez
d32d007f4d Fix service/peer check race
Changing to fetching the list of peers first, then check if the riak_kv
service is up. If the service is up, then check the peers. Otherwise it
is possible to see the service down, then peers up because it went up in
the interim.

Also, making KV vnode delay configurable.
2014-06-20 14:26:24 -04:00
Engel A. Sanchez
3bf0954253 Test ensemble peers wait for riak_kv service
Now ensemble peers are prevented from starting up until the riak_kv
service is up to avoid nasty races that could even lead to node crashes
as the ensembles frantically query for data that isn't ready.
2014-06-19 23:26:30 -04:00
Kelly McLaughlin
4b9a77c828 Re-initiate fullsync after a number of failed checks for completion
Re-initiate fullsync after 100 failed checks for completion. The
number of retries of the 'start fullsync and then check for
completion' cycle is configurable using
repl_util:start_and_wait_until_fullsync_complete/4 and defaults to 20
retries. This change is to avoid spurious test failures due to a rare
condition where the rpc call to start fullsync fails to actually
initiate the fullsync. A very similar changed for the version of the
start_and_wait_until_fullsync_complete in the replication module
introduced in 0a36f9974c has had good
effect at avoiding this condition for v2 replication tests.
2014-06-19 14:34:56 -06:00
Kelly McLaughlin
3466aa7c24 Merge branch 'bugfix/fix-repl-object-reformat' 2014-06-18 22:14:35 -06:00
Jon Anderson
c424848bb0 Merge pull request #644 from basho/bugfix/jra/listkeys_eqc_setup
Fix occasional setup errors in verify_listkeys_eqcfsm
2014-06-18 20:29:00 -05:00
Jon Anderson
f3f5e40a36 Removed commented function. 2014-06-18 18:15:30 -05:00
Kelly McLaughlin
f8e10f2f75 Reinstate concurrency in replication_object_reformat test 2014-06-18 15:55:27 -06:00
Kelly McLaughlin
87ee6f5883 Fix replication_object_reformat test failure
Part of the condition checking done in the replication_object_reformat
test is to validate the results of a fullsync using
repl_util:validate_completed_fullsync/6. The way in which the the
function is called from the test expects fullsync to complete with 0
error_exit or retry_exit conditions occurring. This requires that sink
cluster be in a steady state with all partitions available. The test
failed to wait for such conditions to occur and instead relied on
performing a node downgrade asynchronously and waiting for up to 60
seconds for a completion message before continuing with the test. The
test was continually failing after a node was downgraded to `previous`
due to partitions being reported as `down` on that node. To resolve
the issue the node downgrade process is now done in the primary test
process instead of in a separate spawned process. After the version
downgrade is complete, the test now waits for the riak_repl and the
riak_kv services, calls rt:wait_until_nodes_ready/1, calls
rt:wait_until_no_pending_changes/1, and finally waits for the
riak_repl2_fs_node_reserver named process to be registered on the
downgraded node. This process is responsible for handling partition
reservation requests and is key to determining the the new node is
able to handle a fullsync without partition errors.
2014-06-18 15:55:27 -06:00
Kelly McLaughlin
c55e473b97 Merge branch 'feature/update-repl-systest-read-use' 2014-06-18 15:52:48 -06:00
Kelly McLaughlin
2f9a3cae4a Update calls to rt:systest_read to handle identical siblings
Update the calls to rt:systest_read in repl_util and
repl_aae_fullsync_util to treat identical siblings resulting from the
use of DVV as a single value.  These changes are specifically to
address failures seen in the repl_aae_fullsync_custom_n and
replication_object_reformat tests, but should be generally useful for
replication tests using the utility modules that and that have
allow_mult set to true.
2014-06-18 14:33:44 -06:00
Andrew J. Stone
7d0301db35 add intercept for riak_kv_ensemble_backend:handle_down/4 in ensemble_vnode_crash 2014-06-17 23:13:44 -04:00
Andrew J. Stone
6c14c7c371 Add test to kill a vnode and vnode proxy
Kill a vnode and it's proxy for a given key and ensure that operation
reads succeed afterwards.
2014-06-17 17:57:15 -04:00
Jon Anderson
baf32904af Remove un-used clean up function. 2014-06-17 17:26:23 -04:00
Jon Anderson
8912210036 Re-enable AAE. 2014-06-17 17:04:39 -04:00
Jon Anderson
472241f180 Take cluster set up out of a state and instead put it in the property. 2014-06-17 16:49:20 -04:00
John Burwell
6733c099c8 Merge pull request #636 from basho/bugfix/jsb/start-ensemble-without-aae
Verify Riak Startup when Strong Consistency is Misconfigured
2014-06-16 09:30:33 -04:00
Micah
c96f318f6a Merge pull request #643 from basho/bugfix/mw/better-isolate-pb_security-certs
isolate certs created for the pb_security tests.
2014-06-12 17:30:36 -05:00
Micah Warren
f7631b42c3 pb_cipher_suites test creates certs in its own dir.
Same reason as pb_security and http_security: to keep other tests
from stomping on it.
2014-06-12 17:22:42 -05:00
Micah Warren
f96847beb8 isolate certs created for the pb_security tests.
This should prevent other tests from interfering in its execution
2014-06-12 17:18:15 -05:00
Kelly McLaughlin
0589935931 Fix problems with cert specifications causing replication_ssl to fail
Fix problem with cacertdir specification in replication_ssl test. The
code used load cert files in v2 replication expects the path specific
by the cacertdir key to only be a directory. With v3 replication the
code used is flexible enough to allow a directory or a file. Also
correct a typo in the certfile path for the SSLConfig1 configuration.
2014-06-12 12:38:58 -06:00
Kelly McLaughlin
5f5c3ac035 Merge branch 'bugfix/replication-upgrade-fixes' 2014-06-12 10:39:53 -06:00
Kelly McLaughlin
21b64526f1 Fix two issues with replication_upgrade test
* Do not attempt to cancel fullsync if the initial attempt to start
  and wait for completion fails. It has not been observed that the
  problem is fullsync starting and not completing in time, but rather
  the issue is that the initial call to start fullsync does not take
  effect. Therefore the cancellation is unnecessary.
* Replace the call to repl_util:wait_for_connection/2 in the node
  upgrade process with a call to
  replication:wait_until_connection/1. This function is geared towards
  v2 replication and should speed up test execution.
2014-06-11 21:53:29 -06:00
Micah
2c5def132c Merge pull request #638 from basho/bugfix/mw/pb_security-using-removed-function
Fixed map crdt creation
2014-06-11 13:50:09 -05:00
Micah Warren
3067209a97 Fixed map crdt creation
riakc_map:add/2 no longer exists, so updated the creation of that key to
use the correct update semantics.
2014-06-11 13:25:13 -05:00
Kelly McLaughlin
0e2b52d8b1 Fix timing issue with jmx_verify test
Replace use of a 40 second sleep in the test_supervision test case
with a wait condition to better handle variances in the time it takes
to progress through 10 retry attempts.
2014-06-11 11:26:45 -06:00
Kelly McLaughlin
0601cd594f Merge branch 'bugfix/replication-upgrade-return-term' 2014-06-10 17:14:31 -06:00
John Burwell
6d8c504dba - Verifies Riak startup behavior when strong consistency is enabled and
AAE is disabled.  (defect https://github.com/basho/riak_kv/issues/959)
- Adds additional console output to reset-current-env to explain
  configuration and steps being executed
- Adds the -n option to the reset-current-env script to specify the
  number of nodes to build.  By default, 5 will be created.
2014-06-10 15:01:10 -04:00
Kelly McLaughlin
ba4db5ac74 Change replication upgrade tests to return pass on success
As of commit 3044839456 tests that
return something other than the prescribed success atom 'pass' to
indicate success result in test failure. Change the
replication_upgrade and replication2_upgrade tests that return the
result of the a call to lists:foreach/2 to instead return 'pass' to
indicate success.
2014-06-10 12:58:57 -06:00
Joseph Blomstedt
4322795d8f Merge pull request #632 from basho/jdb-auto-ensemble
Update ensemble tests to work with auto-activation
2014-06-06 16:45:43 -07:00
Engel A. Sanchez
cf10cf96a4 Merge pull request #594 from basho/feature/bitcask-tombstone2-upgrade
Verify Bitcask tombstone 2 upgrade
2014-06-06 10:34:15 -04:00
Joseph Blomstedt
65e15a50ad Update ensemble tests to work with auto-activation
Prior to this commit, the various riak_ensemble related tests would
manually enable the consensus system on one-and-only-one node in a
given cluster in order to work around issue basho/riak_core#571.

This commit changes the tests to work properly after the above issue
has been fixed.

In addition to removing the call to riak_ensemble_manager:enable()
that is now handled automatically by Riak, this commit also removes
a few wait_until_stable/2 checks against 1-node clusters. These
checks no longer apply, since Riak is now designed to only enable
the consensus system after the cluster contains at least 3 nodes.
2014-06-05 16:43:12 -07:00
Christopher Meiklejohn
e07c5eb3f6 Merge pull request #626 from basho/bugfix/bjs/riak_repl_cancelled_connections_cleanup
Disconnect and cancelled connection tests
2014-06-05 16:09:34 -04:00
Christopher Meiklejohn
b208394f63 Adapt test with better assertions and new cluster API. 2014-06-05 16:08:08 -04:00
Micah
ce1969d53b Merge pull request #621 from basho/bugfix/mw/pb-cipher-suites-and-http-security
Change http_security uses different directory for certs then pb_cipher_suites
2014-06-05 11:34:28 -05:00
bsparrow435
a5746cb416 Adding re-connection test
Adding onto the existing test to prove that a reconnection to the same
endpoint can occur after the cancelled connection has been removed.
2014-06-04 21:46:57 -04:00
bsparrow435
c89de8dac9 Address PR comments
Changed intercept to explicitly return `{error, econnrefused}`. Moved
helper functions to `repl_util` and added a new helper to distinguish
between disconnects on `cluster_by_name` and `cluster_by_address`
connections.

Added asserts to all wait_for functions.
2014-06-04 19:51:21 -04:00
Joseph Blomstedt
100180e7ff Merge pull request #593 from basho/ajs/ensemble_remove_node 2014-06-04 11:05:34 -07:00
Joseph Blomstedt
f822e52fe8 Extend ensemble_remove_node2 to test shutdown
ensemble_remove_node2 uses an intercept to prevent a riak_ensemble
related transition that is necessary for nodes to completely exit and
shutdown after removal. In fact, testing for this scenario is the
entire point of this test, since it is testing logic that was added to
solve basho/riak_core#572 and that logic prevents nodes from exiting
until that transition occurs.

However, even without this new logic, there is an unrelated
riak_ensemble related bug that can trigger a race condition that also
prevents nodes from shutting down.

The good news is that other changes made as part of the solution to
solve basho/riak_core#572 also fix this unrelated bug. Therefore this
commit extends ensemble_remove_node2 to remove the intercept at the
end of the test and verify that the removed nodes do actually end up
exiting as expected. Thus, the test now tests for both the negative
and positive scenarios and serves as a test against future regressions
that stall node removal/shutdown.
2014-06-03 15:50:06 -07:00
John Burwell
610c1a6ab8 Merge pull request #627 from basho/bugfix/jsb/fix-list-keys-missing-bucket-type-client
Test the operation of list keys and buckets for undefined bucket types
2014-06-03 16:59:47 -04:00
Engel A. Sanchez
27b15c8d37 Merge pull request #630 from basho/feature/wait-for-bucket-props
Add wait until bucket type visible
2014-06-03 16:35:56 -04:00
Evan Vigil-McClanahan
7f0b898e33 Merge pull request #625 from basho/membackend-test
add expanded memory backend tests
2014-06-03 11:50:01 -07:00
Engel A. Sanchez
595c13019d Add wait until bucket type visible
Adding a test to verify a bucket type is visible from a number of nodes
since the active status is given as long as the claimant sees it. But
requests to other nodes can end up hitting the dreaded {error, no_type}.
Also added a general utility that can be used for bucket type checks and
for general verification of bucket properties across nodes.
2014-06-03 14:03:24 -04:00
John Burwell
36741bb977 - Adds tests to verify the operation of list keys and list buckets when
an undefined bucket type is specified.  (defect #875)
- Adds a description of the reset-current-env.sh script and its
  usage to README.md
- Corrects a spelling mistake in an information message emitted by
  the reset-current-env.sh script
2014-06-03 13:44:05 -04:00
Micah Warren
fc13bbcf3c Added cacert to http_security path.
While r16b02-basho5 did not need the cacertfile path put in, r16b03 did.
The test still passes r16b02-basho5 with the added cacertfile line. Since
there is no harm in putting it in, better for forwards compatibility than
not.
2014-06-03 12:41:35 -05:00
Andrew J. Stone
2c7fc9fb4b Merge pull request #628 from basho/bugfix/ensemble-timouts
increase timeouts for SC operations in ensemble tests
2014-06-03 13:29:49 -04:00
Sean Cribbs
3ffab65863 Remove clients tests because they are already running on every commit on builders. 2014-06-03 11:44:07 -05:00
Micah
b14d093ce5 Merge pull request #589 from basho/vinoski/http_security_cluster
remove localhost dependencies in http_security test
2014-06-03 11:31:00 -05:00
evan
b431ba19a1 reorder tests to avoid test-breaking side-effects 2014-06-02 15:23:33 -07:00
evan
0295115e14 fix for ee versions 2014-06-02 13:53:35 -07:00
evan
bdeb8a4138 fix coyote error 2014-06-02 13:40:21 -07:00
Andrew J. Stone
a35be4bf28 increase timeouts for SC operations in ensemble tests 2014-06-02 16:40:19 -04:00
evan
131ee7bf2f add debugging info 2014-06-02 13:25:44 -07:00
Micah Warren
2b6c86b8bc Removed reference to outdated add-map feature. 2014-06-02 15:18:21 -05:00
evan
492f52ce6c try to make the eviction check fully deterministic 2014-06-02 11:01:17 -07:00
bsparrow435
941efca4ac Disconnect and cancelled connection tests
Adding tests and intercepts for connection_mgr to test disconnect and
cancelled connection cleanup.
2014-06-02 13:03:11 -04:00
Sean Cribbs
98b180ae4c Merge pull request #611 from basho/feature-bug/rdb/removes-remove
Update dt test for reset-remove semantic
2014-06-01 15:42:10 -05:00
evan
5d5cdce079 add expanded memory backend tests 2014-05-31 09:17:41 -07:00
Jon Anderson
27390fb2e2 Merge pull request #624 from basho/bugfix/jra/rex
Add test for safe_rpc when rex is down
2014-05-30 16:44:35 -04:00
Jon Anderson
110bf7e1ab Restart rex and make sure rpc works again. 2014-05-30 15:44:06 -04:00
Jon Anderson
8623a7415c Add riak_test to exercise riak_core_util:safe_rpc rex problem fix. 2014-05-30 14:55:34 -04:00
Engel A. Sanchez
177f901f50 Merge pull request #623 from basho/bugfix/bucket-types-and-mr-key-filter
Verify bucket type + MR filters fix
2014-05-30 13:07:03 -04:00
Engel A. Sanchez
7d8eae7386 Verify bucket type + MR filters fix 2014-05-30 12:00:20 -04:00
Kelly McLaughlin
ce401b0b0d Merge branch 'bugfix/replication2-race' 2014-05-30 09:23:54 -06:00
Christopher Meiklejohn
b1752e0a26 Merge pull request #617 from basho/feature/csm/location-down
Add ability to test the location_down behavior.
2014-05-30 10:49:59 -04:00
Kelly McLaughlin
d82e65c633 Fix a race condition in replication2 test
Remove an assertion based on reading keys a single time after realtime
replication is re-enabled in the test. Instead just rely on the wait
condition that already followed the assertion to read and verify the
same keys.
2014-05-29 16:56:21 -06:00
Micah Warren
c313638cb4 Using different directory for certs than pb_cipher_suites
There are different cert chains for pb_cipher_suites and http_security.
The certs were not fully cleaned between tests, so it would cause the
test to fail. By just using a different directory to put the certs into,
it better isolates the tests.
2014-05-29 13:48:54 -05:00
Micah Warren
ae30256d20 Fixed return from confirm so test acutally passes. 2014-05-29 13:48:38 -05:00
Sean Cribbs
ef56dadad1 Merge pull request #610 from basho/bugfix/sdc/riak-api-61-error-no-type
Check for the correct bucket props response on non-existent type
2014-05-29 12:11:02 -05:00
Christopher Meiklejohn
a07018cb4c Merge pull request #618 from basho/feature/csm/cancel-fullsync
Ensure we can cancel a fullsync and restart.
2014-05-28 21:34:38 +02:00
Andrew J. Stone
6cd649d5ab Ensure keylist pids are stopped when fs cancelled
When fullsync is cancelled, we want to make sure that the keylist pids
it was using actually go away. Add that code to repl_cancel_fullsync.
2014-05-28 15:23:41 -04:00
Christopher Meiklejohn
236ad884f9 Verify that after stopping, no sources running. 2014-05-28 14:51:54 +02:00
Christopher Meiklejohn
6cebbd371a Add ability to test the location_down behavior.
Assert that we properly handle retries of failed partitions, when the remote location happens to be down.
2014-05-28 13:59:23 +02:00
Andrew J. Stone
8c3beedcc8 Make repl_cancel_fullsync more robust
* add repl_util:wait_until_fullsync_started/1
 * add repl_util:wait_until_fullsync_stopped/1
 * remove timeouts and use above calls to confirm our test is in the
   right state
2014-05-27 17:41:15 -04:00
Andrew J. Stone
59ad7841ba add stop command and read assertion to repl_cancel_fullsync 2014-05-27 13:04:19 -04:00
Christopher Meiklejohn
dfdfb51739 Ensure we can cancel a fullsync and restart. 2014-05-26 23:03:04 +02:00
Kelly McLaughlin
19ae3dee6c Merge pull request #600 from basho/bug/er/search-switch-btype
Update mapred_search_switch with new behaviors
2014-05-23 10:44:17 -06:00
Kelly McLaughlin
d7da9f1711 Merge pull request #608 from basho/bug/mw/repl2-allow-mult
Removed 'allow_mult false' for test.
2014-05-22 16:51:48 -06:00
Kelly McLaughlin
bc52f94bda Merge branch 'refactor/change-ok-test-returns' 2014-05-22 16:35:08 -06:00
John R. Daily
429064e18b Add tests for bugs reported against riak_core: 917, 918 2014-05-22 18:18:32 -04:00
Kelly McLaughlin
1f538d7ee0 Update tests that return something other than pass for success
As of commit 3044839456 tests that
return something other than the prescribed success atom 'pass' to
indicate success result in test failure. Change tests that return the
atom 'ok' or some other value to instead return 'pass' to indicate
success.
2014-05-22 15:54:23 -06:00
Ryan Zezeski
3044839456 Merge pull request #613 from basho/rz/no-fails
A return of `fail` should actually report failure
2014-05-21 18:29:53 -04:00
rzezeski
f78de19560 Add always_fail_test
Add a test that always fails to make sure that the pass/fail logic is
working.
2014-05-21 17:13:27 +00:00
Russell Brown
8d2daa62e6 Update dt test for reset-remove semantic
And dopping of "add" fro Map API
2014-05-20 14:41:15 +01:00
Sean Cribbs
6cfdf26a4b Check for props on non-existent bucket type. See basho/riak_api#62 2014-05-19 15:21:49 -05:00
Micah Warren
43ae4e2871 Removed 'allow_mult false' for test.
This reverses an earlier change to support a feature that has been stripped
(for now). When said feature is put back in, it should support multi.
Setting this to allow mutlt = true allows for more confidence in tests.
2014-05-19 12:48:08 -05:00
Sean Cribbs
c2c97efb82 Use the wrapper function for getting a PB connection. 2014-05-19 11:32:02 -05:00
Sean Cribbs
3476bf4d7f Added check for list-keys and 2i security on non-default bucket-types. 2014-05-17 12:46:23 -07:00
Joe DeVivo
cfd473ad23 Merge pull request #603 from basho/refactor/confbal/ssl
Using riak_core ssl app settings instead of riak_api
2014-05-13 13:48:55 -07:00
Steve Vinoski
6e7bbb8b27 remove localhost dependencies in http_security test
Make sure the http_security test works when run on an actual cluster under
rtcloud.
2014-05-13 15:48:09 -04:00
Joe DeVivo
d654a438d7 Using riak_core ssl app settings instead of riak_api 2014-05-13 09:04:34 -07:00
Jon Anderson
6fd07010d7 Add EQC ifdefs. 2014-05-13 11:44:45 -04:00
Engel A. Sanchez
bda1b5c3cf Merge pull request #543 from basho/refactor/one-wait-4-aae-trees
Merge repl and rt versions of wait until AAE trees build
2014-05-12 15:54:36 -04:00
Engel A. Sanchez
31588c5d22 Replacing repl_util AAE wait with rt version 2014-05-12 14:55:59 -04:00
Andrew Thompson
29def51378 Merge pull request #601 from basho/adt-use-make_certs
Switch all the selfsigned certificates to be generated on demand
2014-05-09 22:41:07 -04:00
Andrew Thompson
6c4afcbcde Switch all the selfsigned certificates to be generated on demand
Using the make_cert tool we can generate arbitrary certificate chains on
demand, so they never have to be regenerated.
2014-05-09 14:46:52 -04:00
Jon Anderson
79a0434c3d Merge pull request #599 from basho/feature/jra/listkeys-eqcfsm
Eqc-fsm version of listkeys statem test
2014-05-09 13:58:59 -04:00
Jon Anderson
0daccbbb81 Refactor test.
- Move noshrink to g_uuid.
- Remove dead code.
2014-05-09 12:27:50 -04:00
Jon Anderson
fe8ffc6c6c Add noshrink call to g_uuid() so avoid shrinking on uuid generation. 2014-05-09 12:01:05 -04:00
Jon Anderson
576f8961d2 Remove unnecessary flags from State. 2014-05-09 11:15:41 -04:00
Jon Anderson
4ece10f424 Complete basic test.
- Fix single bucket handling.
- Remove unused preconditions from statem test.
- Add postconditions for verification.
2014-05-09 10:50:13 -04:00
Eric Redmond
0f6cda672b Bucket type missing from MR verify, search with wildcard 2014-05-08 21:38:41 -07:00
Kelly McLaughlin
acd0f3fbe6 Merge branch 'bugfix/verify-busy-dist-port-hang' 2014-05-08 16:18:43 -06:00
Jon Anderson
5a78809cde Rename states to be more sensible. 2014-05-08 15:25:38 -04:00
Christopher Meiklejohn
de0f50fd86 Merge pull request #597 from basho/bugfix/csm/riak_control
Resolve riak_control failures.
2014-05-08 12:40:43 -04:00
Christopher Meiklejohn
32b6a598e9 Resolve riak_control failures.
Resolve failures with cuttlefish configuration changes in Riak 2.0.

Remove riak_control_upgrade, since riak_control should cover those use
cases completely.
2014-05-08 12:40:13 -04:00
Jon Anderson
3d4a836915 Added druuid depenency to fix g_uuid() failure in
verify_listkeys_eqcfsm.
2014-05-07 11:35:34 -04:00
Jon Anderson
6131a7be5e Fixed post-test State handling for cleanup. 2014-05-06 17:28:12 -04:00
Kelly McLaughlin
0de33c7b66 Resolve issue with verify_busy_dist_port test hanging indefinitely
The verify_busy_dist_port helper function cause_bdp:spam_nodes/1
recently changed to be more aggressive in triggering busy_dist_port
warnings. The function changed to spawn 1 million processes to ensure
the test generated enough activity to trigger the warnings, but that
number of processes exceeds the 256 thousand process limit that is the
Riak default. One consequence of this can be that the rex server
responsible for handling rpc calls can crash. In some cases this leads
to rpc calls by riak_test to shutdown the riak nodes involved in the
test to hang indefinitely. This change reduces the number of processes
spawned to 200 thousand. This should still be enough processes to
trigger the busy_dist_port warnings, but without exceeding the beam
process limit.
2014-05-06 14:40:47 -06:00
Jon Anderson
6ce90f87be Eqc-fsm version of verify_listkeys_eqcfsm. 2014-05-06 15:28:28 -04:00
Joe DeVivo
ffadda64a2 Fixed all hardcoded references to 127.0.0.1 in replication2_pg 2014-05-01 15:38:02 -06:00
Joe DeVivo
004de13036 potential fix for replication2_pg:test_mixed_pg_ssl 2014-05-01 15:38:02 -06:00
Joe DeVivo
60036e57f4 replication2_pg fixes for rtcloud 2014-05-01 15:30:32 -06:00
Kelly McLaughlin
f9cf5f840c Merge branch 'bugfix/repl-upgrade-wait-for-rt-conn' 2014-04-25 08:30:46 -06:00
Kelly McLaughlin
2e0991f9e2 Merge branch 'bugfix/verify-counter-capabilities-failure' 2014-04-24 22:02:43 -06:00
Kelly McLaughlin
e2a1ccd653 Wait for realtime connections after node upgrades in replication tests
Prior to Riak 1.4.8 replication registers as a service prior to
completing all initialization tasks including establishing realtime
connections to sink clusters. This leads to a race condition in the
replication_upgrade and replication2_upgrade tests where the test may
begin writing data to the source cluster to verify the function of
realtime replication before the most recently upgraded node
establishes a connection to the sink cluster. The result of this is
that the data is silently discarded by the realtime replication system
and the test fails because all of the expected data is not replicated
and able to be read on the sink cluster. Change the
replication_upgrade and replication2_upgrade tests to explicitly wait
for the realtime connection to be established after each source
cluster node is upgraded before proceeding with the test.
2014-04-24 21:43:05 -06:00
Engel A. Sanchez
1ed2120668 Verify Bitcask tombstone 2 upgrade
Verify that upgrading Riak with Bitcask to 2.0 or later will trigger
an upgrade mechanism that will end up merging all existing bitcask
files.  This is necessary so that old style tombstones are reaped,
which might otherwise stay around for a very long time. This version
writes tombstones that can be safely dropped during a merge. Bitcask
could resurrect old values easily when reaping tombstones during a
 partial merge if a restart happened later.
2014-04-24 18:08:40 -04:00
Christopher Meiklejohn
bd0721ec32 Merge pull request #588 from basho/feature/csm/repl-stats-1
Add basic moving target stats test.
2014-04-24 12:00:02 +01:00
Christopher Meiklejohn
20fb678e17 Merge pull request #587 from basho/feature/csm/downgrade-rt
Refactor test to assert downgrade.
2014-04-24 11:19:01 +01:00
Kelly McLaughlin
c5cab6a2b2 Merge branch 'bugfix/replication2-pg-test-pg-proxy' 2014-04-23 14:56:56 -06:00
Andrew J. Stone
eb9bc44c66 prevent non-deterministic failures in ensemble_remove_node2
Add a wait_untiL_stable call right before the last read from the root
ensemble.
2014-04-23 16:11:07 -04:00
Andrew J. Stone
0cb1bd4357 check ring state for ensemble_remove_node 2014-04-23 15:47:43 -04:00
Kelly McLaughlin
6bb72e2747 Avoid failure in verify_counter_capabilities after node upgrade
Establish a new PB connection to the legacy node after it is upgraded
in order to avoid a failure. The PB connection may close if the node
upgrade takes too long and its reuse in such a case can lead to test
failure because use of the pid returns {error, disconnected} errors.
2014-04-23 09:49:15 -06:00
Kelly McLaughlin
274fc6648b Merge branch 'er/verify_dt_upgrade' 2014-04-23 06:57:32 -06:00
Kelly McLaughlin
f045c2e093 Call verify_counters during the upgrade 2014-04-23 06:55:06 -06:00
Andrew J. Stone
f171095ad2 A second test for node removal / kv ensemble changes 2014-04-23 00:08:04 -04:00
Andrew J. Stone
3a398b3010 ensemble_remove nodes k/v ensemble checks
Ensure that k/v ensembles reconfigure to exclude a leaving node before
that node transitions to exiting status.
2014-04-22 19:21:32 -04:00
Christopher Meiklejohn
09f7f88776 Add basic moving target stats test.
Use this as a platform to start testing reports of missing stats from
replication.
2014-04-22 10:38:33 +01:00
Christopher Meiklejohn
07b91fab36 Refactor test to assert downgrade.
When performing the test of object reformatting through replication,
assert that if we happen to downgrade the format we can still read the
keys which have been replicated.
2014-04-20 16:09:43 +00:00
Andrew J. Stone
d1420f6482 remove outdated print statements 2014-04-18 16:15:34 -04:00
Andrew J. Stone
2deb46b1e5 Test cluster state and root ensemble members after node leaving in ensemble_remove_node 2014-04-18 16:06:37 -04:00
Kelly McLaughlin
823dd2fe9c Wait for transfers to complete in replication2_pg:test_pg_proxy
Wait for transfers to complete in
replication2_pg:test_pg_proxy. Replication tests that test the n_val=1
request option can fail with insufficient_vnodes errors if the cluster
setup does not include waiting for transfers to complete. Change the
test_pg_proxy test case to wait until transfers complete on the "A"
and "B" clusters before proceeding.
2014-04-17 11:05:41 -06:00
Kelly McLaughlin
be61a1c338 Fix an error that can lead to failure of tests using replication2_pg
Fix an error that can lead to failure of tests using
replication2_pg:test_pg_proxy test case. A protocol buffers connection
is established to a node in the "B" cluster, the leader node from that
cluster is shut down, and then that protocol buffers connection is
used to exercise proxy_get. If the connection was established to the
former leader and that is subsequently shut down it can cause the test
to stall and eventually fail. This changes that test to establish a
new connection to a node remaining in the "B" cluster to use for the
proxy_get and prevents the test from stalling.
2014-04-16 17:38:18 -06:00
Andrew J. Stone
106f0608ae add ensemble_remove_node 2014-04-16 17:15:19 -04:00
John Daily
aa91baf273 Merge pull request #549 from basho/jrd-bucket-types-naming
Update tests for UTF-8 bucket types (and buckets)
2014-04-16 11:59:59 -04:00
Kelly McLaughlin
5b66c7a354 Add n_val to bucket type specificaitons 2014-04-15 15:08:23 -06:00